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Methods Of Modifying Carbohydrate Moieties 

Field of the Invention 

The invention is directed to a method for converting glycosylation patterns 
on proteins and cells to a hybrid or complex pattern. This is especially useful for 
recombinantly-produced, therapeutic proteins that are to be administered to a 
mammalian, especially human, patient. 

Background of the Invention 

Glycosylation aflfects many properties of a glycoprotein, including proper 
protein folding, protease resistance/sensitivity, intracellular trafficking and 
compartmentalization, secretion, inter- and intra-molecular associations, 
intermolecular affinities, tissue targeting and biological half-life. Glycosylation 
patterns may also significantly alter the biological activity, solubility, clearance, 
intermolecular aggregation and antigenicity, especially of those proteins that are 
administered therapeutically in vivo. 

Unfortunately, the recombinant production of many proteins can greatly 
alter their glycosylation pattern. Proteins expressed in bacteria are completely 
unglycosylated. Baker's yeast glycosylation patterns are not equivalent to 
mammalian counterparts and are highly antigenic in mammals (Ballou, C.E., 1982. 
In Strathern, J.N. et al (eds). The Molecular Biology of the Yeast 
Saccharomyces. Metabolism and Gene Expression. Cold Spring Harbor 
Laboratory Press, New York, pp. 335-360). 

Similarly, non-complex glycosyl moieties are observed when insect cells, 
such as the popular Sf9 baculovirus expression system, are used to express 
mammalian proteins (Davidson et al, Biochemistry, 29:2828-2838 (1990); Bahl, 
O.P. et al., in Cell Surface and Extracellular Glycoconjugates - Structure and 
Function, D.D. Roberts et al., eds., Academic Press, Inc. 1993 pp. 245-270). 
High mannose (Man) and hybrid oligosaccharides are synthesized on proteins 



WO 96/21038 



PCT/FI96/00002 



.2 

produced in insect cells. The primary structures of these glycans are not different 
from those found on certain mammalian proteins such as ribonuclease B, 
thyroglobin, tissue plasminogen activator (the latter one has a mixed population 
of high mannose and complex structures at its glycosylation sites). 

Small mammalian cefls cultures are practical for use in laboratory 
mammalian gene expression experiments. However, the cost and difficulties of 
expressing large quantities of therapeutic recombinant glycoproteins in mammalian 
cell cultures are prohibitive. Therefore, there is a need for a host that is capable 
of producing large quantities of recombinant proteins that possess, or can be 
modified to possess, a glycosylation pattern that is similar to that produced by 
mammalian cells. 

Filamentous fungi, for example, Trichoderma, have certain advantages as 
recombinant hosts. It is easy to grow large amounts of these fungi and they have 
the ability to glycosylate and efficiently secrete large yields of recombinant 
mammalian proteins into the medium, making isolation relatively easy. In 
addition, the glycosylation pattern on such proteins is more similar to humans than 
that of baker's yeast. However, there are still differences, which places wild-type 
fungal glycosylation patterns at a structural and functional disadvantage when in 
vivo efficacy of a protein requires a specific type of glycosylation. For example, 
terminal sialic acid residues are important to the functioning of a protein in a 
mammalian system, as they impede glycoprotein clearance from the mammalian 
bloodstream. The mechanism behind the increased biologic half-life of sialylated 
molecules is believed to lie in their decreased recognition by lectins (Drickamer, 
K, X BioL Chent 263:9557-9560 (1988)). However, fungal cells are not capable 
of adding such units. Glycoproteins synthesized in fungal cells are asialic. 

Another disadvantage presented by the inability to generate complex 
glycoprotein moieties lies in the presence of terminal mannose residues. 
Glycoproteins terminating in mannose residues are ligands for mannose-binding 
proteins on macrophages and cells from the reticulo-endothelial system 
(Ezekowitze/o/., CellScL SuppL 9:121-133 (1988)). While this can be useful 
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in some limited instances, as for targeting purposes, this has an undesirable 
pharmacological consequence for most recombinant proteins — e.g., the rapid 
clearance of the compound from the blood. Rapid clearance may deleteriously 
effect the administered agent's pharmacokinetics and decrease its therapeutic 
potential or increase its toxicity. Thus, recombinantly produced non-complex 
glycosylation patterns hamper the use of recombinantly made glycoproteins for 
human therapeutic use (Sareneva,T. et al. t Interferon Res. 73:267-269 (1993)). 

These disadvantages may block the biological or pharmacologic usefulness 
of mammalian proteins generated using fungal transformation systems. Therefore, 
there has been a long-felt need in the glycoprotein industry to devise an (in vitro) 
glycosylation system that would provide complex type glycosylation patterns 
similar to those found on proteins from the higher eukaryotes. 

Summary of the Invention 

Recognizing the need for the large scale, economical production of 
recombinant proteins that have a mammalian glycosylation pattern, the inventors 
have investigated glycosyl residue processing in mammalian and fungal cells. 
These studies have led to the discovery that filamentous fungi such as 
Trichoderma are capable of secreting a desired protein in an immature 
glycosylated precursor form that is amenable to in vitro processing to a hybrid 
type mammalian glycosylation pattern. This discovery has resulted in the 
development of methods of preparing hybrid and complex protein glycosylation 
patterns, similar to those found in mammalian hosts. 

Accordingly, in a first embodiment, the glycosylation pattern of a desired 
glycosylated protein (glycoprotein) is sequentially modified by reaction with N- 
acetylglucosaminyl-transferase I (GlcNAc Tr I), galactosyltransferase and 
sialyltransferase, so as to produce a protein having a hybrid-type glycosylation 
pattern with terminal sialic acid(s) similar to that of mammalian cells. 
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Accordingly, in a further embodiment, the glycosylation pattern of a 
desired glycosylated protein is sequentially modified first by reacting such protein 
with a-l,2-mannosidase, to improve reaction with GlcNAc Tr I, 
galactosyltransferase and sialyltransferase, so as to produce a hybrid type 
glycosylation pattern with a terminal sialic acid residue(s) similar to that of 
mammalian cells. 

Accordingly, in a further embodiment, glycosylated proteins are expressed 
in, and preferably secreted from, a fungal host that has been transformed with 
GlcNAc Tr I, and modified by sequential reaction with galactosyltransferase and 
sialyltransferase so as to produce a hybrid-type glycosylation pattern with terminal 
sialic acid, a structure similar to that of mammalian cells. 

Accordingly, in a further embodiment, the glycosylation pattern of the 
desired glycosylated protein produced in a fungal host, preferably Trichoderma, 
that has been transformed with a-l,2-mannosidase and/or GlcNAc Tr I, is 
modified by reacting such protein sequentially with GlcNAc Tr I, if necessary, and 
with galactosyltransferase and sialyltransferase, so as to produce a protein having 
a hybrid-type glycosylation pattern with a terminal sialic acid structure similar to 
that of mammalian cells. 

Accordingly, in a further embodiment, the immature glycosylated form of 
a desired glycosylated protein is produced in a fungal host, preferably 
Trichoderma, that has been transformed with a recombinant gene encoding a- 1,2- 
mannosidase and/or GlcNAc Tr I, preferably human GlcNAc Tr I, and is further 
modified by in vitro reaction with non-specific mannosidase, so as to produce a 
protein having a mono-antennaiy complex glycosylation pattern in one in vitro 
step. 

Accordingly, in a further embodiment, any of the above proteins having a 
hybrid type glycosylation pattern are converted to a complex pattern by reactions 
with an a 1,2-; a 1,3-; and/or a 1,6- mannosidase. 
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Brief Description of the Figures 

Figure 1 . Figure 1 is a diagram that illustrates the conversion of high 
mannose oligosaccharides (glycosyl structures) to hybrid oligosaccharides after the 
reaction of high mannose structures with a-1,2 mannosidase, GlcNAc Tr I, P-1,4 
galactosyl transferase (shown as GalTr), and ot-2,6 sialyltransferase (shown as 
NeuNAc T). The conversion of fungal glycoprotein high mannose 
oligosaccharides (glycosyl structures) to hybrid structures occurs as a result of this 
illustrated series of reactions. 

Figure 1 A. Figure 1 A is a diagram that illustrates the conversion of hybrid 
oligosaccharides (glycosyl structures) to complex type oligosaccharides after the 
reaction of hybrid structures with an aspecific (non-specific) a-mannosidase. 

Figure 2. Figure 2 shows a map of plasmid pCAGGS. The major 
divisions are lengths of 1000 bp, the subdivisions are lengths of 200 bases. The 
length is approximately 4811 bp. AG:P-actin/p-globin hybrid promoter; 
SVORLbidirectional origin of SV40; RBS: lac operon ribosome binding site; 
polyA: polyA signal of the SV40 early region (clockwise arrow) or late region 
(counterclockwise arrow); Ori: Eco-pMBI origin or plasmid replication; AMP: 
ampillicin resistance gene; CMV: CMV-DE enhancer; lac: lac promoter, rGBf 
(plasmid bases 10-182): partial exon 3 and 3' untranslated region and poly A site 
of the rabbit P-globin gene; 3FR (plasmid bases 183-541): 3'-flanking region of 
rabbit p-globin gene. 

Figure 3. Figure 3 shows a map of plasmid pSCGALlMF3. The major 
divisions are lengths of 500 bp, the subdivisions are lengths of 100 bases. The 
length is approximately 3477 bp. AMP: arapicillin resistance gene; ORI: origin of 
plasmid replication (Eco-pMBI); lac: lac promoter; RBS: lac operon ribosome 
binding she; GALl: Gall promoter, PREMF: a-mating factor 1 prepro sequence; 
lacZ: LacZot gene. 

Figure 4. Figure 4 shows a map of plasmid pCAMFhGNTIf,, containing 
a human gene fragment encoding N-acetylglucosaminyltransferase I (hGlcNAc-Tr 
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I). The major divisions are lengths of 1000 bp, the subdivisions are lengths of 200 
bases. The length is approximately 6223 bp. AMP: ampicillin resistance gene; 
ORI: origin of plasmid replication (Eco-pMBl); lac: lac promoter, RBS: lac 
operon ribosome binding ate; PREMF: o-mating factor 1 prepro sequence; rGBf 
(plasmid bases 1-170): partial exon 3 and 3' untranslated region and poly A site 
of the rabbit p-globin gene; 3FR (plasmid bases 171-530): 3 '-flanking region of 
rabbit P-globin gene; SVORI: bidirectional origin of replication of SV40; poly A: 
poly A signal of the SV40 early (clockwise arrow) or late region 
(counterclockwise arrow); CMV: CMV-IE enhancer, Ac: chicken P-actin 
promoter. 

Figures 4A, 4B, 4C and 4D show the sequential construction of 
pCAMFhGNTIfl. All abbreviations are as above. Figure 4 A diagrams the 
insertion of the hGNTI fragment into pUC18; figure 4B diagrams the extraction 
of the GAL 1 -PREMF fragment from pSCGALlMF3 and its insertion into pUC18; 
Figure 4C diagrams the construction of the pUC18 plasmid that contains the 
hGNTI sequence (without its signal sequence) after the PREMF signal sequence; 
and Figure 4D diagrams the final construction of pCAMFhGNTIfl by inserting the 
PREMF-hGNTI fragment that had been taken from the vector constructed in 
Figure 4C into pCAGGS. 

Figure 5. Figure 5 shows an autoradiogram that was obtained after 
exposure of an X-ray film to radiolabelled glycoproteins that had been separated 
on an SDS-polyaciylamide gel electrophoresis (SDS-PAGE). The following 
GHcNAc Tr I substrates were added as noted: Lane 1, no substrate protein added; 
lane 2, human transferrin added as a negative control; lane 3, S. cerevisiae 
invertase added as a negative control; lane 4, ovalbumin added as a positive 
control; lane 5, cellulases from Trichoderma; lane 6, proteins secreted from T. 
reesei mutant RUTC 30. 

Figure 6. Figure 6 shows an autoradiogram that was obtained after 
exposure of an X-ray film to radiolabelled oligosaccharide structures that had been 
separated on a thin layer chromatography plate (TLC). The following were added 
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to the reaction mixture containing GIcNAc Tr I: Lane 1, no oligosaccharides 
added to reaction; lanes 2 and 3, high mannose oligosaccharides from ribonuclease 
B hydrazinolysis, Oxford Glycosystems, (lane 2), and, Man 5 Gn from mannosidosis 
urine, Sanbio, (lane 3), were reacted as positive controls; lane 4, oligosaccharides 
isolated from Trichoderma cellulases; lane 5, oligosaccharides isolated from 
secreted proteins from T. reesei mutant RUTC 30; lane 6, glycosyl structures 
released from 51 cerevisiae invertase as negative control. 

Figure 7. Figure 7 shows an autoradiogram that was obtained after 
exposure of an X-ray film to radiolabeled glycoproteins that had been separated 
on 12.5% SDS-PAGE. Lane 1, the incorporation of radioactive 2-deoxy-2-N- 
acetylamino-D-glucose (GIcNAc) on CBH I is shown; lane 2, a clear 
electrophoretic shift of CBH I was the consequence of the incorporation of 
radioactive sialic acid is shown. 

Figure 8. Figure 8 shows the electrophoretic resolution of ANTS- 
oligosaccharides prepared from T. reesei RUTC 30 glycoproteins. Lane 1: the 
CBH I digest; lane 2: CBH I oligosaccharides treated with A. saitoi a- 1,2- 
mannosidase; lane 3: digest from a mixture of T. reesei RUTC 30 secreted 
protons; lane 4: analogous to lane 3, but with <x-l,2-mannosidase treatment; lane 
5: bovine ribonuclease B digest; M: molecular weight markers (a:maltotetraose; 
b:maltopentaose; c:maltohexaose; dimaltoheptaose; e:maltooctaose; 
fmaltononaose). 

Figure 9. Figure 9 shows an autoradiogram of a thin layer chromatogram 
as above, separating free oligosaccharides pretreated with a- 1,2 mannosidase and 
subsequently treated with GIcNAc Tr I and UDP-( w C)GlcNAc (uridine 
diphospho-2-deoxy-2-N-acetylamino-D- ,4 C-glucose). To the GIcNAc Tr I 
reaction mixtures, the following were added: Lane 1, oligosaccharides were not 
added; lane 2, Ma^GlcNAc (Sanbio), an acceptor substrate for GIcNAc Tr I, was 
added as a positive control; lanes 3 and 4, respectively, S. cerevisiae invertase 
oligosaccharides not pretreated and pretreated with a- 1,2 mannosidase; lanes 5 
and 6, respectively, oligosaccharides from Trichoderma cellulases (Fluka, Buchs, 
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Switzerland) not pretreated and pretreated with a- 1,2 mannosidase; lanes 7 and 
8, respectively, oligosaccharides from T. reesei mutant RUTC 30 not pretreated 
and pretreated with a- 1,2 mannosidase. 

Figure 10. Figure 10 shows an autoradiogram of an SDS-PAGE 
separating glycoproteins reacted with GlcNAc Tr I and UDP-( l4 C)GlcNAc, with 
or without M pretreatment M with A. Saitoi a- 1,2 mannosidase. Lanes 1 and 2, 
respectively, no protein acceptor substrate without and with a-1,2 mannosidase 
treatment; lanes 3 and 4, respectively, transferrin without and with pretreatment 
with a-1,2 mannosidase; lanes 5 and 6, respectively, S. cerevisiae invertase 
without and with pretreatment with a-1,2 mannosidase; lanes 7 and 8, 
respectively, ovalbumin without and with pretreatment with a-1,2 mannosidase; 
lanes 9 and 10, respectively, Trichoderma cellulases (Fluka, Buchs, Switzerland) 
without and with pretreatment with a-1,2 mannosidase. / 

Figure 11. Figure 1 1 shows an autoradiogram of a 12.5% SDS-PAGE gel 
containing various proteins reacted with GlcNAc Tr I and UDP-( H C) GlcNAc. 
Lane 1 is a-amylase from Aspergillus oryzae. Lane 2 is a-amylase from 
Aspergillus niger. Lane 3 is an ovalbumin (positive control). Lane 4 is no protein 
(negative control). Lane 6 is papain digested cellobiohydrolase I from 
Trichoderma reesei RUTC 30. Lane 7 is a galactosidase isolated from 
Aspergillus oryzae. The markers (M) are "rainbow" [ U C] labeled marker proteins 
from Amersham, Buckinghamshire, UK. 

Figure 12. Figure 12 shows an autoradiogram of a thin layer 
chromatograph demonstrating formation of hybrid structures at the 
oligosaccharide level. Lane 1 shows * GlcNAc transferred to Man^ GlcNA^ from 
ribonuclease B. Lane 2 shows *Gal linked to *GlcNAc on commercial Man 5 
GlcNAc^ Lane 3 shows *sialic acid transferred to *Gal linked to *GlcNAc on 
commercial Manj GlcNAcj. Lanes 4, 5, and 6 are analogous to lanes 1, 2, and 3, 
respectively, but the acceptor substrates are N-glycans liberated from proteins 
secreted by Trichoderma reesei RUTC 30. 
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Detailed Description of the Preferred Embodiment 

In the following description, reference will be made to various 
methodologies known to those skilled in the art of molecular genetics, 
microbiology and general biology. Publications and other materials setting forth 
such known methodologies to which reference is made are incorporated herein by 
reference in their entireties as though set forth in full. 

General principles of glycoprotein biology and of glycoprotein structural 
analysis are set forth, for example, in Glycoprotein Analysis in Biomedicine, 
edited by E. F. Hounsell, Humana Press, Totowa, New Jersey (1993), and in Cell 
Surface and Extracellular Glycoproteins, Structure and Function, edited by D.D. 
Roberts and R.P. Mecham, Academic Press, Inc., San Diego, CA (1993). 

General principles of the biochemistry and molecular biology of the 
filamentous fungi and Trichoderma are set forth, for example, in Finkelstein, D.B. 
et ah, eds., Biotechnology of Filamentous Fungi: Technology and Products, 
Butterworth-Heinemann, publishers, Stoneham, MA (1992), and Bennett, J.W. et 
al t More Gene Manipulations in Fungi, Academic Press - Harcourt Brace 
Jovanovich, publishers, San Diego, CA(1991). 

L Definitions 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
which this invention belongs. In the description that follows, a number of terms 
used in glycoprotein technology are extensively utilized. In order to provide a 
clear and consistent understanding of the specification and claims, including the 
scope to be given such terms, the following definitions are provided. 

"Glycosylation pattern" refers to a characteristic structure, number or 
location of oligosaccharide structures associated with a macromolecule, such as 
a protein, or typical of a specific cell type or species. 
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"Wild-type glycosylation pattern" refers to the characteristic native 
structure, number or location of oligosaccharide structures that are associated 
with a macromolecule, such as a protein, or found on a specific cell type or 
species. 

"0-linked M oligosaccharides are those oligosaccharides that are linked to 
a peptide backbone through threonine or serine. 

"N-linked" oligosaccharides are those oligosaccharides that are linked to 
a peptide backbone through asparagine, by way of an asparagine-N- 
acetylglucosamine linkage. The N-linked oligosaccharides are also called "N- 
glycans." All N-linked oligosaccharides have a common pentasaccharide core of 
Man 3 GlcNAc2 (MannoseaGlucoseNAcetylj) (Glucose can be abbreviated as Glc 
and Glue). They differ in the presence of, and in the number of branches (also 
called antennae) of peripheral sugars such as fucose and sialic acid. 

N-linked oligosaccharides are categorized according to their branched 
constituents. If the branched constituent is only mannose, the oligosaccharide is 
said to be a "high mannose #-g!ycan." As used herein "high mannose" or "high- 
mannose type" oligosaccharide or glycosyl structure means an oligosaccharide 
with additional a-mannose residues linked to the outer core structure 
Man 3 GlcNAc 2 , and no fucose or sialic acid residues at the free ends of the 
oligosaccharide branches. 

A "complex" or "complex-type" oligosaccharide or glycosyl structure 
means the structure of an oligosaccharide with typically two to six outer branches 
with a sialyllactosamine sequence linked to an outer core structure Ma^GlcNAc^ 
A complex N-glycan has at least one branch, and preferably at least two, of 
alternating GlcNAc and galactose (Gal) residues that terminate in oligosaccharides 
such as, for example: NeuNAc-; NeuAca2-6GalNAcal-; NeuAca2-3Gaipi- 
3GalNAcal-; NeuAca2-3/6Galpl-4GlcNAcpl-; GIcNAcal-4Gaipi- (mucins 
only); Fucal-2Gaipi- (blood group H); 
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NeuAca2-3Galp 1 -3GlcNAcp 1 -; 

2,6 

NeuAco 

GalNAcal 

\ 

3 

Gaipi - (Blood group A) 
2 

/ 

Fucol 



Galctl 

\ 

3 

Gaipi - (Blood group B) 
2 

/ 

Fucal 



Fucal 

\ 

4 

GlcNacpi -(Le b ); 
3 

/ 

Fucal -2 Gal pi 
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Fucal-2Galpl 

\ 

4 

GIcNacpi-CLeO; 
3 

/ 

Fucal 



Fucal 

\ 

4 

GlcNacpl - (Sialyl Le'/Le'); 
3 

/ 

±NeuAcec2-3Gaipi 



Fucal 

\ 

3 

GlcNacpl - (Sialyl LeYLe"); 
4 

/ 

±NeuAca2-3Gaipi 



GalNAcpi 

\ 

4 

Galpl-(SD') 
3 

I 

NeuAco2 
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Sulfate esters can occur on galactose, GalNAc, and GlcNAc residues, and 
phosphate esters can occur on mannose residues. NeuAc (Neu: neuraminic acid; 
Acracetyl) can be Oacetylated or replaced by NeuGl (N-glycolylneuraminic acid). 
Complex N-glycans may also have intrachain substitutions of bisecting GlcNAc 
and core fiicose (Fuc). 

"Hybrid N-glycans" have only mannose residues on the Manal-6 arm of 
the core and one or two complex antennae on the Manal-3 arm. 

An "acceptor substrate" is an oligosaccharide, the specific structure of 
which is recognized as a substrate by an enzyme, such that reaction with such 
enzyme adds certain sugar residues to said acceptor substrate. For example, the 
oligosaccharide Ma^GIcNACj is an acceptor substrate for GlcNAc Tr I, the first 
glycosyltransferase involved in complex carbohydrate formation. 

By an "immature precursor" glycosylation pattern, orN-glycan structure 
is meant a high mannose structure. According to the invention, "immature" 
structures can be "matured" to a hybrid or complex type structure by treatment 
with enzymes that add the appropriate units of the hybrid or complex-type 
oligosaccharides, with or without the removal of the mannose residues as 
necessary. 

By "sequentially modified" is meant that an oligosaccharide (glycan) 
structure is enzymatically modified in an ordered manner, involving two or more 
enzymatic reaction such that at least one reaction precedes the other. This is also 
referred to as a "cascade" series of enzymatic reactions. 

By "reacting a protein with an enzyme" is meant that the protein is 
provided in a reaction mixture that contains all the required components for 
catalyzing the enzyme's inherent activity to modify such protein. 

Linkage between monosaccharides units in a glycan can be to any of the 
hydroxyl groups with either a P or an a anomeric configuration. When drawn as 
shown below (for example at carbon 1 ("1" below) of GlcNAc and sialic acid), the 
p or a configuration is depicted as a line above or below the plane of the 
monosaccharide ring, respectively. 
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The pyranose form of P-D-N-acetylglucosamine (GlcNAc) is: 




The structure of sialic acid is: 




where R=CH r CO-(N-acetylneuraminic acid) 3 or CHOH-CO-(N- 
glycolylneuraminic acid); the hydroxyl groups can be substituted with different 
acyi substituents and those at C8 and C9 with additional sialic acid residues. 

Certain abbreviations are used herein as are common in the art, such as: 
"Ac" for acetyl; "glc H for glucose; "foe" for fiicose; "GlcNAc" for N- 
acetylghicosamine; "man" for mannose; "PNGase F w for peptide N-glycosidase F 
(EC 3.2.2.18). 

As used herein, "OHcNAc Tr F means N-acetylglucosaminyl-transferase I. 



WO 96/21038 



PCT/F196/00002 



15 



//. The Glycosylation Pathway 



In glycoproteins, potential N-glycosylation sites contain the sequence 
asparagine-X-serine (or threonine), where X can be any amino acid except proline. 
Proteins can be engineered to contain artificial N-glycosylation sites, or the natural 
site may be used. 

The methods of the invention will be better understood by reference to the 
glycosylation pathway as described below. It is believed that the early events of 
N-glycan synthesis that are executed in the endoplasmic reticulum are identical in 
yeasts, plants and higher eukaryotes and fungi. The pathway of N-glycosylation 
of proteins in eukaryotic cells generally follows this pattern. First, a lipid-Iinked 
precursor oligosaccharide (I): 



ctl.2 
Man Man 




81 4 614 
Man -S^-GlcNAc -i-^-GlcNAc Lipid 



al,2 al3 al,2 m a\2\ m 

Glc--— Glc — ^-Glc — —Man — —Man— — Man 



is transferred from the lipid and attached an Asn on the target protein through the 
action of an oligosaccharyltransferase to form (II): 
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al.2 
Man Man 



Man Man 




61 4 

GlcNAc -^-^-GlcNAc — Protein 



This is also abbreviated: GlCaMa^GlcNACj-protein. The enzyme a- 1,2 
glucosidase I then removes the terminal glucose residue, and ct-1,3 glucosidase II 
removes the second and third glucose residues to leave (HI): 



al,2 
Man Man 



M cxl.2 ' \ . . (DD 

Man Man 



ctl,2 al,2 
Man — — Man Man 




61 4 

GlcNAc -i—^-GlcNAc Protein 



This is an example of a "high-mannose type" glycosylation. This is also 
abbreviated: Mai^GlcNACj-protein. 

The final step in the rough endoplasmic reticulum is the removal of one 
mannose residue. Remaining steps are performed in endoplasmic reticulum (ER) 
and the Golgi apparatus. Up to this point, the glycosylation pathway of both yeast 
and mammalian cells is the same. However, steps in the cis-Golgi differ between 
yeast and the higher eukaryotes. Yeast add mannoses by the action of 
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mamiosyltransferases, the result of which is a high mannose type glycosylation in 
the glycosylated product. However, higher eukaryotes remove three additional 
mannoses, with cc-mannosidase I; in some cases, this trimming also takes place in 
the ER. The result is the product (IV): 



Man 




(IV) 



Man-^GIcNAc-^-GlcNAc — Protein 



Man 



This is also a "high mannose" structure and is abbreviated: ManjGlcNAc^-protein. 
The high energy sugar nucleotide, UDP-CHcNAc, serves as a substrate for GlcNAc 
Tr I to transfer a GlcNAc moiety to (TV) to produce (V): 



Man 




(V) 



Man -^^-GlcNAc-J^-GlcNAc Protein 



Glc NAc Man 



/al,3 
„ / 
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This is a hybrid-type structure and is abbreviated: GlcNacManjGlcNAc^protein. 
a-Mannosidase II recognizes this substrate and removes additional mannose 
residues to result in the "hybrid type" structure (VI): 



Gic NAc Man 



Man 

a\ t 6 

/ 



\ BL4 01,4 (VD 

Man-^-GlcNAc-i-^-GlcNAc — Protein v 



This is abbreviated: GlcNacMan 3 GlcNAcj-protein. This can serve as a substrate 
for any of UDP-GlcNAc, UDP-Gal or CMP-sialic acid (SA), and GlcNAc- 
transferase n, galactosyltransferse or sialyltransferase, respectfully, to yield a 
"complex type" glycoslyation pattern, for example, as shown below (VII): 

Neu-NAc— Gal— Glu-NAc^-Man 

al,o / 

'814 814 
Man— GlcNAc— GlcNAc — Protein 

\al,3 

Neu-NAc^- 6 Gal P -^GIu-NAc^-Man 



III. Glycosylation Modification Enzymes 

The glycosylation modification enzymes that are useful in the methods of 
the invention include transferases and mannosidases. Mannose residues that are 
linked by an al,2 linkage are those mannose residues that can be removed in vivo 
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or in vitro by a "specific" ctl,2 mannosidase, such as that of Aspergillus saitoi. 
GlcNAc Tr I is very specific and only transfers GlcNAc to the less periferic (de- 
linked mannose of a Man 3 GlcNAc 2 structure (an appropriate substrate is shown 
as structure IV, above). After transfer of GlcNAc, galactose and sialic acid to 
Man 3 GlcNAc2, the remaining al,3 and al,6-linked mannose residues can be 
removed in vivo or in vitro using an "aspedfic" mannosidase such as, for example, 
the commercially available Jack bean mannosidase, thus converting a hybrid 
structure to a complex one. 

Transferases such as the GlcNAcTr I, galactosyltransferase and 
sialyltransferase exemplified herein, catalyze the transfer of a monosaccharide 
from a high-energy sugar donor to an acceptor oligosaccharide. The generalized 
reaction is: sugar nucleotide + acceptor => acceptor sugar + nucleotide. 

To follow the reaction in vitro y a sugar nucleotide can be provided in a 
form that will label the acceptor. For example, the specific activity of the 
commercial radiolabeled sugar nucleotides is sufficiently high to allow detection 
of fmoles of acceptor substrate in a radiolabeled assay. 

Neither glycosyltransferases nor the sugar nucleotide substrates are 
permeable to cell membranes. This allows the extracellular alteration of cell 
surface glycosylation on intact cells or sealed membrane preparations, using the 
method of the invention in those embodiments wherein the GlcNAc Tr I reaction 
does not occur within the host cell. 

GlcNAcTr I (EC 2.4.1.101) is the medial Golgi transferase that initiates 
complex N-linked carbohydrate formation. This enzyme is also known as UDP-N- 
acetylglucosamine:a-3-D-mannoside p 1,2-N-acetylgIucosaminyltransferase I. 
The human gene encoding this enzyme has been cloned and its coding sequence 
published (Kumar, R. etaU Proc. Natl Acad Set, USA 57:9948-9952 (1990). 
Other GlcNAc Tr I sources include rabbit (Sarkar, M. et al 9 Proc, Natl Acad. 
Set USA 55:234-238 (1991) and mouse (Kumar, R. etal t Gfycobiology 2:383- 
393 (1992)). 
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Other enzymes useful in the construction of the hybrid oligosaccharide, as 
shown in Figure 1, are also available and have been cloned. For example, a non- 
limiting list includes bovine P(l,4)galactosyl transferase (D'Agostaro, G. et al, 
Eur. J. Biochem. 7*3:211-217 (1989), human P(l,4)galactosyl transferase (Masri, 
K.A. et al., Biochem. Biophys. Res. Common. 757:657 (1988), rat 
o(2,6)sialyltransferase (Wang, X.C. et al., Gfycobiology 7.25-31 (1990), and 
rabbit al,2 mannosidases (Lai, A. etal, J. Biol. Chem. 269:9872-9881 (1994)) 
or with mouse al-2 mannosidases (Herscovics, A. et al, J. Biol. Chem. 
269:9864-9871 (1994). 

To clone a reported sequence, primers may be designed based upon the 
reported sequence. For example, primers such as those of SEQ ID No: 1 and 
SEQ ID No: 2 can be used to amplify the coding sequence of human GlcNAc Tr 
I, for example, using PCR techniques, so as to insert such coding sequence into 
any vector of choice, and preferably an expression vector capable of expression 
of such coding sequence in a desired host. 

The GlcNAc Tr I enzyme is not glycosylated. Mouse GlcNAc Tr I has 
been expressed by Kumar, R. et al, Gfycobiology 2:383-393 (1992). It is 
possible and useful to express mouse and human GlcNAc Tr I intracellularly in 
bacteria, but it may form protein aggregates. For this reason, intracellular 
production in bacteria is not the preferred method for the production of GlcNAc 
Tr I. The enzyme can be produced in mammalian cells such as COS cells, be 
secreted into the medium, and used directly in that medium if desired; it should be 
noted that use of the enzyme in the spent growth medium is simply for 
convenience; the cell's growth medium environment is not necessary per se for 
GlcNAc Tr I activity as the enzyme produced in bacteria was active. Isolation 
methods for the enzyme from rabbit liver are also known (Nishikawa, Y. et al, J. 
Biol Chem. 263:8270-8281 (1988)). 

In the methods of the invention, GlcNAcTr I reactions are preferably run 
at 37°C. One unit is that enzyme activity that catalyzes transfer of 1 umol 
GlcNAc from UDP-GicNAc to Man,GlcNAc within 1 minute at 37° C and pH 6. 1 
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As exemplified, 330 \imo\ acceptor substrate and 340 pmol UDP-GlcNAc were 
used with 10-15 fil of a 10-fold concentrated COS supernatant (containing about 
30 microunits GlcNAc Tr I in a reaction mixture with a final volume of 30 pi. 
One unit of enzyme activity converts 1 pmol substrate in one minute. Reaction 
times will vary depending upon the concentration and specific activity of the 
enzyme, but generally 5-10 hours, and, as exemplified, 1-4 hours, are sufficient for 
the purposes of the methods of the invention. The GlcNAc Tr I reaction can be 
run for a longer time as the scale of the reaction is increased (Ichikawa et al y 
Anal. Biochem. 202:215-238 (1992)). 

The enzymes do not need to be pure to be useful, as exemplified using the 
COS cell medium as a source of GlcNAc Tr I. However, large scale isolation of 
the glycosyltransferases can be achieved using nucleotide-affinity adsorbents as 
known in the art. The glycosyltransferase may be modified by the addition of an 
affinity tag (such as the streptavidine-tag or a histidine-tail) in order to enhance 
purification and reduce purification costs. 

P-1,4 Galactosyl transferase, EC 2.4.1.38, such as that from human milk 
exemplified herein, is commercially available from Boehringer Mannheim (catalog 
no. 1088-696), and Sigma Chemical Co., St. Louis, Missouri. Its purification 
from bovine milk was reported by Barker, R. et al. f J. Biol Chem. 247:7135 
(1972). In the presence of a-lactalbumin, EC 2.4.1.38 becomes EC 2.4.1.22, 
lactose synthetase (also known as lactose synthase and UDP-D-galactose: D- 
glucose 4P-galactosyltransferase), and accepts glucose as a substrate for the 
transfer of galactose to produce lactose (Yoon et al, Gfycobiology 2:161-168 
(1992)). This enzyme is commercially available from Sigma and Oxford 
Glycosystems. It is also easy to purify (Barker, R. et al, J. Biol. Chem. 247:7135 
(1972)). Bovine galactosyl transferase can be used in the same way and under the 
same reaction conditions as that from human. Cloned P-1,4 galactosyl transferase 
has been expressed in several systems (see, Masibay, A. et al. t Proc. Natl Acad 
Sci. USA 56:5733-5737 (1989), Aoki, D. et al. t EMBO J. 9:3171 (1990) and 
Krezdorn, C. etal. r Eur. J. Biochem. 272:113-120 (1993)). 
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The enzymatic reaction conditions of p-1,4 Galactosyl transferase, EC 
2.4.1.38, are known in the art (Parchment, R.E. et al, Anal Biochem. 754:460- 
469 (1986); Nunez, H. and Barker, R, Biochemistry 79:489-495 (1980)). The 
enzyme requires a pH of 8-8.5 and 1-50 mM manganese, preferably 20 mM, for 
optimal enzymatic activity. Reactions are preferably run at 37°C, with about 65 
mU enzyme and 5 umol UDP-galactose per ml. One unit of enzyme activity is the 
enzyme activity that catalyzes transfer of 1 umol galactose from UDP-galactose 
to glucose in the presence of a-lactalbumin, in 1 minute at 37°C and pH 8.4. 
However, for commercially available galactosyl transferase (5.5 units per mg), in 
the absence of a-lactalbumin and with GlcNAc as the substrate, the enzymatic 
activity may be reduced, for example, to about 3.5 U/mg. Reaction times will vary 
depending upon the concentration and specific activity of the enzyme, but 
generally 6-10 hours for small scale, and, as exemplified, 8 hours, are sufficient for 
the purposes of the methods of the invention. Longer times may be necessary as 
the scale of the reaction is increased. 

o-2,6 Siaryltransferase, EC 2.4.99. 1, such as that from rat liver exemplified 
herein, is commercially available from Boehringer Mannhein (catalog no. 981- 
583), and from Genzyme, Calbiochem, and Sigma. Its purification was reported 
by Weinstein, J. etal., J. Biol. Chem. 257:13,835-13,844 (1982). The enzyme is 
also known as CMP-N-acerymeuraminite:P-D-galactosyl-l,4-N-acetyl-P-D- 
glucosamine o-2,6-N-ac^lneuraminyltransferase. The K,, for the sugar donor 
CMP-NeuAc is 8.5 x 10 S M. The acceptor sequence is Gaipi,4GlcNAc-R. 
Purification methods for sialyltransferases involved in N-glycosyl synthesis are 
known (Weinstein, J. et al, J. Biol. Chem. 262:17735-17743 (1987)). 
Recombinant production of sialyltransferases has been reported in mammalian 
COS and CHO cells (Schachter, H. et al, in "Molecular Glycobiology," Fukuda, 
M. et al, eds., IRL Press, pp. 88-162 (1994). The cloning of human a-2,6 
siaryhransferase was reported in Grundmann, U. et al, Nucleic Acids Res. 75:667 
(1990). 
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One unit of cc-2,6 sialyltransferase will transfer 1 pmol N-acetylneuraminic 
acid from CMP-N-acetylneuraminic acid to asialo-a r glycoprotein in 1 minute at 
37°C (Weinstein, J. etal, J- BioL Chent 257:13835-13844 (1982)). The enzyme 
requires a pH of 6-6.5 for optimal activity, and 50 mM NaCl is generally added 
to the assay to stabilize the enzyme's activity. Reactions are preferably run at 
37°C, with about 4-40 mU enzyme, 0.5-5 pmol CMP-NANA per ml (100-800 pg 
acceptor protein per ml were used in the exemplified embodiments), and 
preferably 3.6 pmol. Reaction times will vary depending upon the concentration 
and specific activity of the enzyme, but generally 6-10 hours for small scale 
syntheses, and longer for large scale application, are sufficient for the purposes of 
the methods of the invention. 

a- 1,2 Mannosidase (EC 3.2.1.24), such as that from Aspergillus saitoi 
exemplified herein, is commercially available from Oxford Glycosystems, Oxford, 
UK. One unit of enzyme is the amount of enzyme that will release one pmole of 
mannose from bakers yeast mannan per minute at pH 5.0 and 37° C. In the 
exemplified embodiments, 6 punits enzyme were used to treat up to 500 pg 
protein in a 30pl final volume, and 2 punks enzyme were used to treat about 5 pg 
bovine ribonuclease B oligosaccharides (Oxford GlycoSystems Cat No. RP-2500) 
in 10 pi final volume. Preferably the pH is about 5.0 for optimal activity of the 
enzyme from A. saitoi. Reactions are preferably run at 37° C. 

Methods for detection of the products of these reactions are well known 
and exemplified herein. 

IV. Modification of Glycosylation Patterns 

According to the invention, the modification of glycoproteins may be 
effected in vitro by carbohydrate modifying enzymes described above that are 
capable of synthesizing or modifying glycosylation structures. Glycoproteins 
synthesized by yeast, filamentous fungi and insect cells, can be structurally and 
functionally dissimilar to mammalian-type caibohydrate structures in having a high 
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mannose or hybrid-type character. However, as embodied by this invention, these 
high mannose oligosaccharide structures can be enzymatically converted into 
hybrid or complex structure, making the carbohydrate structure on the yeast, 
fungi, or insect produced protein more similar to, or identical to, the mammalian- 
produced carbohydrate structure. 

It has been surprisingly discovered that filamentous fungi such as, for 
example, Trichoderma, unlike yeast, synthesize glycoproteins in an form that is 
recognizable as an acceptor substrate for the enzyme N-acetylglucosaminyl 
transferase I (GlcNAc Tr I), especially human GlcNAc Tr I. This form is an 
"immature" mammalian form. 

However, GlcNAc Tr I is not naturally present in these hosts. According 
to the invention, the above finding is exploited such that the glycosylation pattern 
of a desired glycoprotein, for example, a recombinant protein that was produced 
in Trichoderma, having a structure that can serve as an acceptor substrate for 
GlcNAc Tr I, for example, formula (TV) above, is modified by a cascade of 
sequential reactions, for example three sequential reactions with the following 
enzymes: first, GlcNAc Tr I; second, P-1,4 galactosyl transferase; and third, ct-2,6 
siatyltransferase. The result of this cascade is the production of a glycoprotein 
having at least a hybrid glycosylation pattern, and, when all branch termini are 
modified, a complex-type glycosylation pattern. Such branches are preferably 
terminated with terminal sialic add units, similar to that of mammalian cells, 
although, as desired other terminal units can be used, such as those described 
earlier. 

As described below, a recombinant host can be prepared such that the first 
reaction, that of GlcNAc Tr I, is performed in vivo, in the host, prior to secretion 
of the protein. 

It has also been surprisingly found that treatment of glycoproteins that 
have been synthesized by yeast and Trichoderma, with a-l,2-mannosidase, so as 
to trim (remove) mannose residues from such glycoprotein, provides (in the case 
of yeast) or enhances (in the case of Trichoderma), acceptor substrate sites for 
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GlcNAc Tr I enzymatic action. According to the methods of the invention, a 
mannosidase may be used to pretreat the acceptor substrate prior to reaction with 
GlcNAc Tr I, to create, or otherwise enhance the availability of, target sites of 
GlcNAc Tr I action on such substrate. 

Therefore, in a first embodiment, GlcNAc Tr I activity is used to convert 
an acceptor substrate of a high mannose type to a hybrid-type. The hybrid type 
can then be converted to a different hybrid type, or, a complex type. In a highly 
preferred embodiment, the source of the acceptor substrate for GlcNAc Tr I is a 
glycoprotein that has been produced in a filamentous fungi, and especially 
Trichoderma, and most especially T. reesei . 

It is not necessary to extract the product of each reaction from the mixture 
prior to starting the subsequent reaction. The pH and assay conditions (such as 
the addition of divalent cations) can simply be adjusted in the same container for 
each subsequent reaction so that handling is minimized. Additionally, solid state 
technology can be used to provide the enzymes on a solid support and the 
acceptor substrate amply passed through such support or the solid support can be 
used in batch form. 

The glycoprotein source of the acceptor substrate can be the native or 
recombinant glycoprotein product of any host that provides such glycoprotein in 
useable form. Especially, the glycoprotein is a native or recombinant product of 
a filamentous fungi, for example, a member of the genera Absidia, Acremonium, 
Alternaria, Aspergillus, Aureobasidium, Beauveria, Ceratocystis, Chaetomium, 
Cladosporium, Collectotrichum, Fusarium, Gliocladium, Mortierella, Mucor, 
Paecilomyces, Penicillium, Phanerochaete, Phytophthora, Pythium, Rhizopus, 
Trichoderma, and Verticillium. Most especially, the host is a member of the 
Aspergillus or Trichoderma genera. 

Trichoderma species useful as hosts for the production of substrate 
acceptors of GlcNAc Tr I include T. reesei, such as QM6a, ALK02442 or 
CBS383.78 (Centraalbureau voor Schimmelcultures, Oosterstraat 1, PO Box 273, 
3740 AG Baarn, The Netherlands, or, ATCC13631 (American Type Culture 
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Collection, 12301 Parklawn Drive, Rockville, Maryland, 10852, USA, type); 71 
viride (such as CBS 189.79 (det. W. Gams); T. longibrachiatum, such as 
CBS816.68 (type); T. pseudokoningii (such as MUCL19358; Mycotheque de 
lUniversite Catholique de Louvain); T. saturnisporum CBS330.70 (type); T, 
harzianum CBS3 16.31 (det. W. Gams); T. virgatum (T. pseudokoningii) 
ATCC24961. Most preferably, the host is Z reesei and especially T. reesei strains 
QM9414 (ATCC 26921), RUT-C-30 (ATCC 56765), and highly productive 
mutants like VTT-D-79125, which is derived from QM9414 (Nevalainen, 
Technical Research Centre of Finland Publications 26, (1985), Espoo, Finland). 
The transformation of Trichoderma may be performed by any technique known 
in the art, including the technique taught in European patent application EP 
244,234. 

Preferred species of Aspergillus that are useful hosts in the methods of the 
invention include A. nidulans, A. awamore and A. niger. Methods for the 
transformation of Aspergillus are known. 

As described below, when pretreated in vitro with al,2 mannosidase, yeast 
such as Pichia spp. (especially Pichia pastoris), Hansenula spp. (especially 
Hansenula polymorpha), Kluyveromyces lactis, Yarrowia lipolytics or S. 
cerevisiae are useful hosts for expressing a gene of interest. Yeast would be useful 
as a host for expressing cloned GlcNAc Trl enzyme since it is not glycosylated. 

In one embodiment, the glycoprotein acceptor substrate is secreted into the 
growth medium of the production host and is modified in vitro by the action of 
enzymes such as GlcNAc Tr I without being removed from such medium. If 
desired, the medium can be concentrated prior to enzymatic modification of the 
gene product of interest. Alternatively, the glycoprotein acceptor substrate can 
be provided to the GlcNAc Tr I in a purified or isolated form. 

In a preferred embodiment of the invention, the coding sequence of 
GlcNAc Tr I is transformed into a desired host cell from which it is also desired 
to synthesize the protein of interest, especially, a host cell that does not naturally 
catalyze the enzymatic modifications according to the methods of the invention. 
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In this embodiment, the first step in the modification of glycosylation pattern, the 
reaction catalyzed by GlcNAc Tr I is effected in vivo. The host cell then 
synthesizes, and preferably secretes, an acceptor substrate of a hybrid type. The 
hybrid type is then converted to a hybrid type that contains terminal sialic acids by 
the second and third steps of the cascade in vitro, that is, the addition of galactose 
residues with p-1,4 galactosyl transferase, followed by the addition of sialic acid 
residues with a-2,6-sialyltransferase. 

Preferably, the recombinant host that is transformed with both GlcNAc Tr 
I coding sequences and, if desired, with the coding sequence for a protein of 
interest (the glycosylation pattern of which is to be modified in vivo by the action 
of the expressed GlcNAc Tr I). More preferably, the recombinant host is a 
member of the Aspergillus or Trichoderma genera as described above. 

The recombinant host can also be transformed with DNA comprising the 
a-1 ,2-mannosidase gene. In embodiments of this invention, the recombinant host 
can be transformed with the a-l,2-mannosidase gene together with the GlcNAc 
Tr I coding sequences and, if desired, the coding sequence for a protein of 
interest. In other embodiments of this invention, the recombinant host is 
transformed with the a-l,2-mannosidase gene alone, or, if desired, the coding 
sequence for the protein of interest. 

Vector systems are known that are useful in transforming hosts, especially 
the Aspergillus or Trichoderma hosts described above for the production of the 
GlcNAc Tr I and glycoproteins of the invention. A separate vector may be used 
to provide the selectable marker 

Expression of genes in a host requires the use of regulatory regions 
functional in such hosts. A wide variety of transcriptional and translational regu- 
latory sequences can be employed; filamentous fungi such as Trichoderma 
generally recognize eukaryotic host transcriptional controls, such as, for example, 
those of other filamentous fungi, and especially, Aspergillus. 

In a preferred embodiment, genetically stable transformants of 
Trichoderma are constructed whereby DNA encoding a desired glycoprotein or 
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carbohydrate modifying enzyme like GlcNAc Tr I is integrated into the 
Trichoderma host chromosome. The coding sequence for the desired 
glycoprotein or enzyme may be from any source. Such integration may occur de 
novo within the cell or, in a most preferred embodiment, be assisted by 
transformation with a vector which functionally inserts itself into the host 
chromosome, for example, DNA elements which promote integration of DNA 
sequences in chromosomes. 

After the introduction of the vector, recipient cells are grown in a 
selective medium, which selects for the growth of the transformed cells. 
Expression of the cloned gene sequences) results in the production of the desired 
glycoprotein and/or carbohydrate modifying enzyme, or in the production of 
desired fragments thereof. This expression can take place in a continuous manner 
in the transformed cells, or in a controlled manner, for example, wherein 
expression is the result of a regulated promoter. 

In a preferred embodiment, a desired glycoprotein or carbohydrate 
modifying enzyme is secreted into the surrounding medium due to the presence 
of a functional secretion signal sequence, preferably a signal homologous to the 
host. If a desired gene does not code for a signal sequence, or if its signal 
sequence does not function well in the host, then the glycoprotein's coding 
sequence may be operably linked to another signal sequence, either homologous 
or heterologous to the host, that functions in such host. The desired coding 
sequence may be linked to any signal sequence which will allow secretion of the 
glycoprotein from the selected host, for example, for secretion from Trichoderma, 
the signal sequence of the Trichoderma cellulase enzymes may be used, for 
example, the secretion signal of the cellobiohydrolase I (CBH1) ceUobiohydrolase 
II (CBHII), endoglucanase I (EGI) or endoglucanase II (EGII) protein may be 
used. Secretion signals of the Trichoderma xylanases are also useful. Such signal 
sequences may be designed with or without specific protease sites such that the 
signal peptide sequence is amenable to subsequent removal. 
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In those embodiments wherein GlcNAc Tr I is provided intraceUularily 
within the fungal host cell so as to act on a substrate intracellularly, it is not 
necessary, and not preferable, to secrete recombinantly produced GlcNAc Tr I 
into the medium as it is necessary that it remain in the host cell to be active in vivo 
in those embodiments. 

Trichodermds an especially useful and practical host for the synthesis of 
the glycoproteins of the invention because Trichoderma is capable of secreting 
protein in large amounts, for example, concentrations as high as 40 g/L culture 
medium have been reported; the homologous cellulase and hemicullulase 
promoters such as the Trichoderma cbhJ , cbh2, egll and egll promoters provide 
very convenient promoters for the high expression of genes of interest because 
they are strong promoters. For example, the cbhl promoter is a single copy 
promoter that normally directs the synthesis of up to 60% of the secreted protein 
from the Trichoderma host. Alternatively, the gene of interest, or the GlcNAc Tr 
I gene, can be inserted into the CBHI locus. The Trichoderma transformation 
system is highly versatile and can be adapted for any gene of interest. Culture of 
Trichoderma is supported by previous extensive experience in industrial scale 
fermentation techniques; for example, see Finkelstein, D3. et al., eds., 
Biotechnology of Filamentous Fungi: Technology and Products, Butterworth- 
Heinemann, publishers, Stoneham, MA (1992). 

V. Proteins of Interest 

The methods of the invention are useful with any desired protein of interest 
that contains high mannose-type glycosylation patterns. Such patterns are 
considered to be "immature" when compared to complex or hybrid patterns. 

Especially, a protein of interest may be a protein, such as an 
immunoglobulin or hormone, that is to be provided to a patient in need of same, 
especially for therapeutic reasons. Examples of glycohormones wherein 
glycosylation plays a role in the properties of such hormone, especially the 
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therapeutic properties, include erythropoietin (EPO), human choriogonadotropin 
(HCG), foffitropin (FSH), thyrotropin (TSH) and lutropin (LH). Modification of 
each peptide subunit according to the methods of the invention may be performed. 
Examples of other proteins of interest whose biosynthesis, transport or function 
are altered by N-glycosylation include NCAM, tenascin, thrombospondin, 
fibronectin, hormones, cytokines, (especially interleukin-4 and interferon gamma), 
growth factors, plasma proteins, coagulation factors, soluble receptors, 
immunoglobulins (antibodies), granulocyte-macrophage colony stimulating factor 
(GM-CSF), vascular endothelial growth factor, HTV virus glycoprotein gp 120, 
viral coat proteins, immunoglobulin D, antithrombin Hip, plasminogen, von 
Willebrand factor, fibrinogen, corticosteroid-binding globulin, thyroxine-binding 
globulin, folate-binding protein, fibrinectin, bone and platelet osteonectin, EGF 
receptor, insulin and insulin-like growth fector-I receptors, basic fibroblast growth 
factor receptor, lymphocyte CD2, MHC Class II molecules, glucose transporters, 
erythrocyte band 3 protein, P-2 adrenergic receptors, transferrin receptor, VIP 
receptor, membrane class I MHC protein, vasopressin receptor of LLC-PK1 cells, 
low density lipoprotein receptor, asialoglycoprotein receptor, CD4 protein, 
thyrotropin receptor, PDGF receptor, jack bean concanavalin A, cobra venom 
factor, lactotransferrin, spinach chloroplast coupling factor, submaxillary gland 
mucin, intestinal brush border lactase-phlorizin hydrolase, arctic fish antifreeze 
glycoprotein, LDL receptor, glucophorin A, and P-HCG, and, generally, secreted 
animal proteins and plant glycoproteins secreted by sycamore cells in culture. 
Especially, those proteins that require targetting to specific organs or cells through 
sugar-lectin recognition are useful protein substrates for modification of 
glycosylation patterns according to the invention. 

The method of the invention is especially useful in designing a protein for 
use in genetic therapy; in such therapy, a protein, such as an enzyme, that is 
deficient in a cell or organ, is targeting to such cell or organ by providing such 
protein with an appropriate glycosylation pattern. 
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VI. Preparation of Modified Glycoproteins as Pharmaceutical 
Compositions 

According to this invention, there is provided a method for producing high 
levels of glycoproteins which are desirable for use, for example, in pharmaceutical 
compositions. Such glycoproteins may be obtained directly from the hosts of the 
invention or the culture medium. Further, if desired activities are present in more 
than one recombinant host (such as multisubunit proteins), such preparations may 
be isolated from the appropriate hosts and combined prior to use in the method of 
the invention; alternatively one host can be used to make all subunits of the desired 
protein. 

To obtain the glycoprotein preparations of this invention, the recombinant 
hosts described above which have the desired properties (that is, hosts capable of 
expressing the glycoproteins) are cultivated under suitable conditions, the 
modified glycoproteins are secreted from the Trichoderma hosts into the culture 
medium, and the glycoproteins are recovered from the culture medium by methods 
known in the art. Alternatively, but less desirably the glycoproteins can be 
recovered from the host cells themselves. 

The glycoprotein is recovered from the culture medium or host cells 
themselves by using routine methods which are well known in the art. The 
glycoprotein of the invention may be lyophilized or the glycoprotein otherwise 
concentrated and/or stabilized for storage. The glycoprotein of the invention are 
very economical to provide and use. If the glycoproteins are secreted into the 
culture medium, only the culture medium need be recovered to obtain the desired 
glycoprotein; there is no need to extract an glycoprotein from the Trichoderma 
hosts unless it is no desire to do so. 

If desired, an expressed glycoprotein may be further purified in accordance 
with conventional conditions, such as extraction, precipitation, chromatography, 
affinity chromatography, electrophoresis, or the like. 

The methods of the invention provide for the manufacture of hybrid or 
complex glycosyl groups on recombinant glycoproteins for their use as 



WO 96/21038 



PCT/F196/00002 



32 

pharmaceutical agents in a safer and more cost-effective manner than is currently 
available. The methods of the invention also provide an important benefit in that 
they result in the synthesis of uniform and well-defined glycosyl structures. 
Production of a single glycoform can be critical if it is desired to obtain uniform 
therapeutic pharmacokinetic profiles. 

As would be understood by one of ordinary skill in the art, the 
pharmaceutical compositions of the present invention can be formulated according 
to known methods to prepare such compositions, whereby the glycoproteins with 
mammalian-type oligosaccharides are combined with a pharmaceutical^ 
acceptable exdpient. Suitable excipients and their formulation are described, for 
example, in Remington's Pharmaceutical Sciences (18th ed., Gennaro, A.R., ed.), 
Mack Publishing Co., Easton PA (1990); see especially Part 4, "Testing and 
Analysis," pp. 435-602, and Part 8, "Pharmaceutical Preparations and Their 
Manufacture," pp. 1435-1712). In order to form a pharmaceutically acceptable 
composition suitable for effective administration, such compositions may 
additionally contain an effective amount of salts, buffers, adjuvants or other 
substances which will improve the efficacy of the composition. 

Pharmaceutical methods may be employed to control the duration of 
action. Controlled release preparations may be achieved through the use of 
polymers to complex or absorb the compositions of this invention, their 
equivalents, or their functional derivatives. The controlled delivery may be 
exercised by selecting appropriate macromolecules (for example polyesters, 
polyamino acids, polyvinyl, pyrrolidone, ethylenevinylacetate, methylcellulose, 
carboxymethylceilulose, or protamine sulfate) and the concentration of the 
macromolecules as well as the methods of incorporation in order to control 
release. Another method to control release is to incorporate the compositions of 
the invention into particles of a polymeric material such as polyesters, polyamino 
acids, hydrogels, poly(lactic acid) or ethylene vinylacetate copolymers. 
Alternatively, it is possible to entrap these materials in microcapsules prepared, for 
example, by coacervation techniques or by interfadal polymerization, for example, 
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hydroxymethylceUulose or gelatine-microcapsules and poly(methylmethacylate) 
microcapsules, respectively, or in colloidal drug delivery systems, for example, 
liposomes, albumin microspheres, microemulsions, nanoparticles, and 
nanocapsules or in macroemulsions. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences, supra. 

The pharmaceutical compositions of the present invention can also be 
formulated for administration orally, or parenterally by injection (for example by 
subcutaneous, intramuscular or intravenous injection), by intravenous or other 
infusion, by nasopharyngeal absorption (intranasopharangeally), percutaneously, 
rectally, ocularly or sublingualis Compositions for parenteral administration may 
include sterile aqueous or non-aqueous solutions, suspensions and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Carriers, adjuncts or occlusive dressings can be used to increase skin permeability 
and enhance absorption. Liquid dosage forms for oral administration may 
generally comprise a liposome solution containing the liquid dosage form. 
Suitable forms for suspending liposomes include emulsions, suspensions, 
solutions, syrups, and elixirs containing inert diluents commonly used in the art, 
such as purified water. Besides the inert diluents, such compositions can also 
include wetting agents, emulsifying and suspending agents, or sweetening, 
flavoring, coloring or perfuming agents. 

In one embodiment, the pharmaceutical compositions of the present 
invention may be formulated as a cream, lotion, ointment, or the like, for topical 
administration to the skin. Such compositions may optionally contain wetting 
agents, emulsifying and suspending agents, or coloring or perfuming agents. 

A composition is said to be "pharmacologically acceptable" if its 
administration can be tolerated by the recipient without the production of serious 
side effects. Such an agent is said to be administered in a therapeutically 
"effective" amount or concentration if it produces the desired biologic effect(s). 
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Generally, the dosage needed to provide an effective amount or 
concentration of the composition can be adjusted by one of ordinary skill in the 
art, such as a medical doctor, and will vary depending upon such factors as the 
route of administration and the recipient's age, condition, sex, and extent of 
disease, if any, and other variables. 

The recipient of the proteins modified according to the methods of the 
invention can be any animal into which it is desired to administer such protein, 
including humans, zoo animals, farm animals (bovine, ovine and the like), and pets 
(cats, dogs, birds, etc.). 

The invention is described in more detail in the following examples, These 
examples show only a few concrete applications of the invention. It is self-evident 
for one skilled in the art to create additional similar applications and embodiments 
thereof Hence the examples should not be interpreted to narrow the scope of the 
invention but rather only to clarify the use of the invention. 

Examples 

Example 1 

Preparation ofTrichoderma reesei Glycoproteins 

Glycoproteins from the T. reesei mutant strain RUTC 30 were obtained 
after growth in a minimal medium for the induction of cellulases (Uusitalo et al. 
7. Biotechn 77:35-50 (1991)). As carbon sources for growth, lactose or a 
combination of lactose/cellulose (2% final concentration) were added. The 
method of Bradford, M. M. {Anal Biochem. 72:248-254 (1976)) was used to 
measure the concentration of the proteins with immunoglobulin G as a standard. 
For glycosyitransferase experiments, the secreted proteins were subsequently 
dialyzed against the proper buffer used in the subsequent glycosyitransferase 
assay. A fest change to the new buffer was possible without significant loss of the 
proteins when using Microcon™ ultracentrifugation devices (Amicon) (Blatt 
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et aU Anal Biochem. 26:151-173 (1968)). A mixture of cellulases from a 
second, unknown strain of T. reesei were commercially obtained from Fluka 
Chemie, Buchs, Switzerland. 

Example 2 
Preparation of Oligosaccharides 

Oligosaccharides were prepared according to a method described by 
Verostek et al (Gfycobiology 2:458 (Abstract 1.06) (1992)). Oligosaccharides 
were released from the glycoproteins (1-20 mg/ml) by recombinant PNGase F 
(Tarentino et a/., Biochem 24:4665-4671 (1985)) (Biolabs). Glycoproteins were 
first dissolved in 50 mM sodium phosphate (pH 7.5), 0.5% SDS and 1% p- 
mercaptoethanol, or in 50 mM Tris-HCl, pH 7.0. The proteins were then 
denatured by boiling for 10 minutes. The nonionic detergent nonidet P-40™ was 
added to prevent the inhibition of PNGase F by SDS. The reaction mixture was 
incubated with 1000 units PNGase F at 37°C for 18 hours. After PNGase F 
treatment, the proteins and oligosaccharides were precipitated by adding four 
volumes of acetone at minus 20 °C. The 80% acetone salt- and SDS-containing 
supernatant was discarded, and the pellet was extracted twice with ice cold 60% 
methanol. Methanol was removed from the pooled supernatants after 
lyophilization by evaporation. Further purification of the oligosaccharides was 
obtained by passage through a Bio-Gel P2™ column (27 cm high, 0.5 cm 
diameter; obtained from BIORAD). The presence of oligosaccharides in the 
different collected fractions was traced through orcinol staining (Revelatevrs pour 
la chromatographic en couche mince etsur papier, Merck, p. 91 (1980)). In a 
second round, pooled oligosaccharide-containing fractions were passed over a Bio 
Gel P2™ (Biorad) column to obtain oligosaccharides that were sufficiently pure 
to use in the glycosyltransferase assays. 
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Example 3 

Amplification of GlcNAc Tr I Genomic DNA Sequence 

The coding region of the GlcNAc Tr I gene (Genbank Accession No. 
M55621; EMBL Accession No. HSGLCNAC; Kumar etal, Proc. Natl 
Acad, Sci. USA 57:9948-9952 (1990); all incorporated herein by reference;) 
from colon carcinoma genomic DNA was amplified using PCR and was 
subsequently cloned into the mammalian expression-vector pCAGGS which 
is described in more detail in Table 3 (Figure 2 and described in Niwa et al, 
Gene 705:193-200 (1991)). The human colorectal adenocarcinoma cell line 
DLD-1 was used as the source of genomic DNA. This cell line is available from 
the American Type Culture Collection, Rockville, Maryland, (ATCC CCL 221) 
and is described in Cancer Research 39:1020-1025 (1979)). Plasmid pCAGGS 
was the gift of J. Miyazaki, Univ. Tokyo, Japan. Plasmid pCAGGS is useful for 
highly efficient expression of genes under the control of the p-actin/p-globin 
hybrid promoter and CMV-IE enhancer in various mammalian cells. pCAGGS 
carries a modified chicken P-actin promoter: the splice acceptor sequence of the 
P-actin promoter region is replaced by a rabbit p-globin fragment containing a 3 ' 
part of the second intron and a 5' part of the third exon. The resulting P-actin/p- 
globin hybrid promoter is designated the "AG 11 promoter. 

Genomic DNA was prepared from colon cells, according to the procedure 
described by Maniatis, T., "Molecular Cloning, A Laboratory Manual," 2nd 
edition, Sambrook et al, eds., Cold Spring Harbor Press, Cold Spring Harbor, 
1989. A genomic DNA fragment representing the coding region of the GlcNAc 
Tr I gene was amplified with Pyrococcus furiosus polymerase (Stratagene) and a 
Pharmacia thermal cycler according to the instructions of the manufacturer. The 
N-terminal oligonucleotide primer consisted of the sequence: 

5 '-CC AGGATGCTGAAGAAGC AGTCTGC A-3 ' (SEQ ID NO:l). 

The C-terminal oligonucleotide primer consisted of the sequence: 
5 '-CC AGTCGACAGGTGCTAATTCC AGCTAG-3 ' (SEQ ID NO:2). 
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The amplification protocol consisted of an initial denaturing cycle of 8 minutes at 
96 °C, followed by a 1 minute annealing step at 65 °C and finally a 4 minute 
polymerization cycle at 75 °C. The amplification then switched to a 1 minute 
denaturation, 1 minute annealing and a 4 minute polymerization for 24 additional 
cycles. 

Example 4 

Subcloning the Polymerase Chain Reaction (PCR)-Generaied Fragments 

A fraction of the PCR fragments has a single adenosine nucleotide added 
at the 3' end because of the template-independent terminal transferase activity of 
Pyrococcus furiosus polymerase. This activity was exploited to clone the 
amplified DNA fragments with the efficiency of sticky end cloning. To do this, the 
pUC18 vector (into which the fragment would be linked) was prepared as follows. 
One ng Hind H-digested vector was incubated with 20 units of terminal 
transferase and 10 jiM ddTTP in a final volume of 30 ^1 for 1 hour at 37°C. 
ddTTP was used to assure the incorporation of only one thymidine at the 3' ends 
of the blunt digested vector (Holton et aL 9 Nucleic Acids Research J9(S): 1 1 56 
(1990)). The vector and the PCR fragments were purified using the Gene Clean™ 
kit from Pharmacia. The amplified GlcNAc Tr I coding fragment (200 ng) was 
finally ligated in the pUC18 vector (40 ng) at 12°C for 17 hours. The first part 
of the gene (MUOCQSAGLVLWGAIIJ^AW 

[SEQ ID NO:3]) which coded for secretion and for retention in the medial Golgi 
compartment was replaced with a nucleotide sequence coding for the "prepro"- 
secretion signal-sequence of the yeast mating factor. This nucleotide-sequence 
was isolated from the plasmid pSCGALlMF3 (Belgian Coordinated Collections 
of Microorganisms (BCCM) catalogue of the Laboratorium voor Moleculaire 
Biologie Plasmidencollectie (LMBP) culture collection at University Gent) after 
digestion with Stu I and Xba I restriction enzymes. (Materials listed as being 
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obtained from the LMBP are available to anyone who requests them). The yeast 
mating factor nucleotide sequence that was used is 

ATG AGA TTT CCT TCA ATT TTT ACT GCA GTT TTA TTC GCA GCA TCC TCC 
MET Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 

GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA GAA GAT GAA ACG GCA CAA 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 

ATT CCG GCT GAA GCT GTC ATC GGT TAC TCA GAT TTA GAA GGG GAT TTC 
He Pro Ala Glu Ala Val He Gly Tyr Ser Asp Leu Glu Gly Asp Phe 

GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG 
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 

TTT ATA AAT ACT ACT ATT GCC AGC ATT GCT GCT AAA GAA GAA GGG GTA 
Phe He Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 

TCT TTG GAT AAA AGG [ Seq ID NO: 4) 
Ser Leu Asp Lys Arg tSeq ID NO: 5 J 

A map of plasmid pSCGALlMF3 appears as Figure 3 and a more detailed 
description in Table 2. 

To make plasmid pSCGALIMB, plasmid pSCGALlMF2 (available from 
the LMBP collection) was opened with AccI, filled in with Klenow DNA 
polymerase and ligated. In this way, the Hindu and Sail sites of the polylinker are 
lost, a Nrul site in the polylinker is created, and the HindU site inside the prepro 
secretion signal of the a-mating factor 1 becomes unique. 

The blunt ended Stu I side was ligated to Eco 47111-digested GIcNAc Tr 
I in the pUC18 vector. The "prepro-GlcNAc Tr I" coding nucleotide sequence 
was then isolated from the pUC18 vector through an Eco RI, Hind m double 
digest. Subsequently, an Eco RI linker was ligated to the blunted Hind EI site to 
allow insertion in the mammalian pCAGGS vector. Directional cloning was 
carried out with Xba I, Eco RI digested vector and fragment. 

In this expression cassette, the first part of the gene that coded for the 
secretion and Golgi-retention signals, was replaced by a nucleotide sequence 
coding for the "prepro" secretion signal sequence of the yeast a-mating factor 
(Burke et a/., /. Biol Chem. 257:24433-24440 (1992); and Brake, A.J., in Yeast 
Genetic Engineering (Barr et a/., eds.), Buttersworths Pub., Stoneham, MA 
(1989), pp. 269-280). The vector was called pCAMFhGNTIfl (Figure 4). Its 
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construction is diagrammed in Figures 4A, 4B, 4C and 4D and it is described 
in more detail in Table 1. Essentially, the XbaVStul fragment of 
pSCGALlMF3 (above), containing the prepro secretion signal of the a- 
mating factor 1 gene, was ligated to the XbahEco47lll vector fragment of 
pUChGNTI. Because of the template-independent terminal transferase activity 
of Pyrococcusjuriosus polymerase, a fraction of the PCR fragments has a single 
adenosine nucleotide added at the 3' end. We exploited this activity to clone the 
amplified DNA fragments with the efficiency of sticky end cloning. To do this, the 
acceptor pUC18 vector was prepared as follows: 1 fig HindU digested vector 
was incubated with 20 units terminal transferase and 10 \iM ddTTP in a final 
volume of 30 nl for 1 hour at 37°C (Holton and Graham, 1990). The amplified 
GlucNAc Tr I coding fragment (200 ng) was ligated in the treated pUC18 
(Norrandere/a/., Gene 26:101-106 (1983) vector (40 ng) at 12°C for 17 hours. 
This intermediate construct was then digested with HindUL and EcoBI, the sticky 
ends were filled in with Klenow DNA polymerase and EcoRl linkers (5'- 
CGGAATTCCG-3 ') were added. Finally, this EcoRI-EeoRl fragment was 
digested with Xbal and EcoFl and the resulting Xbal-EcoRl fragment was 
inserted into the Xbal-EcoRl digested pCAGGS vector. This expression plasmid 
carries a fusion of the prepro secretion signal of the a-mating factor 1 gene to the 
mature human gene encoding N-acetylglucosaminyltransferase I (GlcNAc Tr I), 
coded into the sense orientation relative to the chicken P-actin promoter that is 
preceded by the CMV-IE enhancer sequence. The nucleotide sequence of the 
human GlcNAc Tr I was obtained from EMBL Accession no. M55621. 

After transient transfection of the vector to COS-7 cells (ATCC CRL- 
1651), the expressed GlcNAc Tr I enzyme was efficiently secreted in Dulbecco 
minimal medium (Dulbecco's modified eagle medium, Dulbecco, R. etaL, Virol. 
8:396 (1959); Smith, ID. etaL, Virol 72:185 (1960); Tissue Culture Standards 
Comm., In Vitro 6:2 and 93 and In Vitro 9:6). The COS supernatant was 
concentrated about tenfold and was used in GlcNAc Tr I activity assays. GlcNAc 
Tr I is enzymatically well-characterized and uses a ManjGlcNAcj oligosaccharide 
as a preferred acceptor substrate (Nishikawa et aL 9 1 Biol Chem. 2tf3(17):8270- 
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8281 (1988)). In general, oligosaccharides with the common formula Rio 
l,6(Mancc l,3Manp 1.4GlcNAcR2) are accepted as substrates for GlcNAc Tr I 
where Rl is one or more mannose residues linked through a bonds; R2 may be 
pl-4(Fucetl-6)GlcNAc-Asn-X, pi-4GlcNAc-Asn-X, pi-4GlcNAc or H). A 
GlcNAc residue derived from UDP-GlcNAc was incorporated in the p 1-2 linkage 
to the mannose residue linked a-1,3 to Man P 1.4GlcNAcR2 (Schachter, H., 
Gfycobiology 7(5):453-461 (1991)). 

Example 5 
The GlcNAc Tr I Assay 

The mammalian vector pCAGGS and "prepro-GlcNAc Tr I" coding 
sequences (pCAMFhGNTIfl) were transfected into COS cells according to the 
protocol ofMcCutchancra/. (J. Nat. Cancer Inst. 47:351-357 (1968)). GlcNAc 
Tr I was secreted in Dulbecco minimal medium without serum. The enzyme was 
concentrated by ultrafiltration in Centricon-30™ centrifugal rnicroconcentraters 
(Amicon). 

GlcNAc Tr I activity was assayed according to a method described by 
Mshikawa et al (J. Biol Chem. 263(1 7):8270-8281 (1988)) as follows. In a total 
volume of 0.030 ml, the following were added: 0.33 mM Man,Gn; 0.1 M 2-[N- 
morphouno]ethanesulfbnic acid (MES) (pH 6. 1); 20 mM MnCl 2 ; 5 mM AMP (as 
pyrophosphatase inhibitor); 0.1 M GlcNAc (as P-N-acetymexose-arninidase 
inhibitor); 0.34 mmol UDP-( 14 C)GlcNAc (Amersham); and 10 ul concentrated 
COS supernatant. Incubation was at 37°C for 1 to 4 hours. The proteins were 
then separated from the oligosaccharides through precipitation by the addition of 
100% ice-cold ethanol to a final concentration of 66%. 

Oligosaccharides were separated on silica 60 high-performance thin-layer 
plates using a solvent containing butanol/acetic acid/water (20: 10: 10; v/v/v). The 
plates were developed twice in the same solvent (Vauhkonen et al., Eur. J. 
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Biochem. 752:43-50 (1985)). Radioactive oligosaccharides were visualized after 
exposure of the TLC plate to an X-ray film for several days. 

Oligosaccharides were also visualized by orcinol staining as follows. A 
mixture of 0.6% orcinol in 10% sulfuric acid and 0.1% Fe(HI)Cl 2 was sprayed on 
top of the TLC plate which was subsequently heated to 100°C for 10 to 15 
minutes. 

Instead of oligosaccharides, where noted, glycoproteins were added to the 
reaction mixture for GlcNAc Tr I. About 10 ng glycoprotein (e.g. ovalbumin) 
was added to the 30 fil reaction mixture. Incubation at 37°C was over a period 
of 6 to 18 hours. The incorporation of radioactive GlcNAc into the 
oligosaccharide units on the proteins, separated on a SDS-polyacrylamide gel 
(Laemmli, U.K., Nature 227:680-685 (1970)), was visualized by autoradiography. 

Example 6 

Galactosyltransferase and Siaiyltransferase Processing 

Additional in vitro modifications of N-acetylglucosamine-bearing 
glycoproteins or oligosaccharides were carried out with p-1,4 galactosyl 
transferase from human milk and ct-2,6 siaiyltransferase from rat liver. Both 
enzymes were obtained from Boehringer Mannheim. 

Before carrying out these processing steps, CBH I was purified from the 
GlcNAc Tr I reaction mixture. A description for the purification of this protein 
is available from Shoemaker et al. 9 Bio/Technology 7:687-690 (1993). Briefly, 
the COS cell medium was passed over a DEAE-sepharose column that had been 
equilibrated in 50 mM sodium-acetate buffer, pH 5. Cellulases (or CBH I alone) 
from T. reesei in a reaction mixture for GlcNAc Tr I treatment, were dialyzed to 
50 mM sodium acetate buffer, pH 5. Then, they were separated from COS 
supernatant proteins through purification on a DEAE Sepharose® column 
(volume: 0.5 ml, about 1 cm in height, 0.5 cm diameter, fast flow resin, 
Pharmacia). Under these conditions, CBH I was retained. 
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A salt gradient (changing the NaCl concentration from 0-to 0. 1 M in 400 
p.1 sodium acetate buffer, pH 5) removed a substantial part of the proteins present 
in the reaction mixture for the in vitro glycosylations, and CBH I was retained on 
the column. The salt concentration was further raised to 0.2 M (in 800 ul), and 
CBH I eluted at 0.15 M NaCl or higher. The salt concentration was further raised 
to 0.5M (by applying 800 ul) in order to elute as much of the CBH I from the 
column as possible. The CBH I-containing fractions were pooled and were 
dialyzed against a 50 mM Tris buffer, pH 8, and concentrated about 10-fold, using 
a Centricon ultracentrifugation device (Amicon). CBH I was then added to the 
reaction mixture for galactosyltransferase: p-1,4 galactosyl transferase required 
a pH of 8-8.5 and the presence of manganese for optimal specific activity 
(Schanbacher et al, J. Biol. Chem. 245:5057-5061 (1970); Yoon et al, 
Glycobiology 2:161-168 (1992)). To the GlcNAc Tr I-treated proteins or 
oligosaccharides were added: 0.17 umol UDP ( I4 C)Gal (285 mCi/mmol) or 
"cold" UDP-Gal; 2 milliunits galactosyltransferase; and MnCl 2 to a final 
concentration of 20 mM in a final volume of 0.030 ml. Galactosylation was 
carried out at 37°C for about 8 hours. 

a-2,6 Sialyltransferase required a pH of 6-6.5 for optimal activity 
(Weinstein et al., J. Biol. Chem. 257:13845-13853 (1982)). To the 
galactosyltransferase-reaction mixture, the following were added: 0.18 fimol 
CMP ( M C)NANA (281 mCi/mmol) (Amersham) to a final concentration of 6 mM; 
2 milliunits ct-2,6 sialyltransferase; and NaCl (to stabilize the sialyltransferase, final 
concentration: 50 mM). The pH was adjusted to 6-6.5 through the addition of 
1MMES. The final volume of the reaction mixture was 0.050 ml. The reaction 
mixture was incubated at 37° C for 8 hours. The incorporation of radioactive 
sugar-residues on glycoproteins was visualized as described above. 

In vitro modifications of glycosylated ribonuclease B, obtained from 
Oxford Glycosystems, were also performed. The glycosylated ribonuclease B was 
incubated with GlcNAc Tr I as described in Example 5. The resulting 
oligosaccharide product is shown in Figure 12, lane 1 . The resulting product was 
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incubated with galactosyl transferase: p-1,4 galactosyl transferase as described 
above to produce *Gal linked to GlcNAc on Man 5 GlcNAc, as shown in Figure 
12, lane 2. This product was incubated with a-2,6 sialyltransferase as described 
above to produce sialic acid linked to *Gal linked to *GlcNAc on commercial 
Man 5 GlcNAc2 as shown in Figure 12, line 3. 

The use of N-glycans, liberated from ribonuclease B, as acceptor 
substrates for GlcNAc Tr I, galactosyltransferase, and a-2,6 sialyltransferase was 
also studied. The N-glycans were liberated as described in Example 2. Following 
liberation from the protein, two GlcNAc-accepting structures are available. 
Therefore, a more complex glycosyiation pattern is observed following sequential 
reaction with GlcNAc Tr I, galactosyltransferase, and a-2,6 sialyltransferase, as 
depicted in Figure 12, lanes 4-6. However, the expected hybrid and complex 
structure were observed. 

Example 7 

Improvement of Hybrid Glycoprotein Structures with &1,2 Mannosidase 

Glycoproteins (up to 100 mg) or oligosaccharides (prepared from about 
100 mg of protein) were treated with2 to 6 flunks ^. saitoi a-1,2 mannosidase 
(Oxford Glycosystems) in 100 mM sodium-acetate (pH 5) for 18 hours at 37°C. 
Glycoproteins were dialyzed to 20 mM MES, pH 6.1, before further processing 
with GlcNAc Tr I. Incorporation of radioactive GlcNAc on oligosaccharides was 
visualized on X-ray film after separation through TLC. Radioactive signals 
originating from ( I4 C)GlcNAc-readues on proteins were visualized as described 
above. 
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Example 8 

Mannosidase and Mannosyltransferase Assays 

Mannosidase activity was assayed in the extracellular medium from T. 
reesei and Aspergillus senior. T. reesei RUTC 30 was grown in minimal medium, 
according to Uusitalo et al. (1991) with 2% lactose as a carbon source. The same 
medium, supplemented with 2% glucose was used in parallel in order to determine 
whether there is an influence of different carbon sources on expression of 
mannosidases. Aspergillus saitoi was grown on minimal medium according to 
Czapek (Onions and Pitt, Appendix: Media. In Hawksworth, D.L. and Kirsop, 
B.E., (Eds.), Living resources for biotechnology. Cambridge University Press, 
Cambridge, UK, pp. 180-199, 1988) with 1% yeast extract and, in parallel 2% 
glucose as the carbon sources. After 4 days fermentation at 28 °C, fungal cells 
were separated from growth medium by filtration over a GF/F glass microfiber 
filter (Whatman). 

The growth medium was further concentrated (70 fold), using centriprep 
and centricon ultracentrifugation devices (M.W. cut off: 10 kD; Amicon). 
Reaction mixtures for assaying mannosidase activities contained in a volume of 20 
ul: 500 ng ANTS labelled high mannose oligosaccharides from ribonuclease B 
(Oxford (Hycosystems); Sodium cacodylate pH 6.1 (60 mM final concentration); 
CaC12 (20 mM) and 15 ul concentrated growth medium Incubation of the 
reaction mixtures was overnight at 37°C. Oligosaccharides were then separated 
on a polyacrylamide gel as described. 

The collected fungal cells were used for preparation of crude extracts: 
after washing with 50 mM sodium cacodylate buffer, the dry cell pellet was 
ground in a mortar using liquid nitrogen. Broken cells were transferred to ice cold 
sodium cacodylate buffer (50 mM, pH 6.1) containing MgC12 (15 mM) and a 
cocktail of protease inhibitors (PMSF, leupeptin, benzamidin, aprotinin in 
concentrations prescribed by the manufacturer (Boehringer Mannheim). Cells 
were left in buffer on ice for 30 minutes. We used differential centrifugation to 
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obtain ISOOOg, 40000g and lOOOOOg membrane pellets. Finally Triton X 100 (2% 
final concentration) was added to solubilize enzymes from the membranes. 
Reaction mixtures for assaying mannosidases were prepared as already described. 
Reaction mixtures for assaying mannosyltransferases contained in a volume of 20 
id: 100 ng ANTS labelled Man,GlucNAc; 10 \xg UDP-mannose; MnCl 2 (20 mM 
final concentration) and 10 pi of the lOOOOOg membrane preparation. The 
reaction mbctures were incubated for 4 hours at 37°C. 

Example 9 

Glycoproteins as Acceptor Substrates for GlcNAc Tr I 

Different glycoproteins were used in GlcNAc Tr I activity assays to 
determine whether they were carrying glycosyi structures that were substrates for 
GlcNAc Tr L Ovalbumin, which was used as a positive control in this experiment, 
has two candidate glycosylation sites, but only one site is actually glycosylated. 
A fraction of the oligosaccharides that are present on ovalbumin is Man 5 Gn 2 (Tai 
et al t J. Biol Chem. 252:6687-6694 (1977)). As negative controls, 51 cerevisiae 
invertase and human transferrin were used. Invertase from S. cerevisiae carries 
high mannose oligosaccharides with Man,Gn 2 being the smallest structure formed 
(Trimble et aL, J. Biol. Chenu 267:9815-9824 (1986)). Transferrin, a human 
glycoprotein, has two complex-type N-glycans (MSrz et a!., Can 1 Biochem. 
60:624-630 (1982)). Most transferrin oligosaccharides are bi-antennar with sialic 
acid as terminal sugar-residues. 

Protons secreted from the T. reesei mutant strain RUTC 30 and cellulases 
from an unknown strain of 71 reesei (commercial source; Fluka, Buchs, 
Switzerland) were studied. 7. reesei mutant strain RUTC 30 has an increased 
capacity to secrete proteins (Ghosh et aL, in Trichoderma reesei Cellulases: 
Biochemistry, Genetics, Physiology & Application (Kubicek et al y eds ), Royal 
Soc. of Chemistry, Cambridge, England (1990), pp. 115-138). The results (Figure 
5) show that radioactive ( u C)GlcNAc was transferred from UDP-C 4 C)GlcNAc 
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to oligosaccharides on ovalbumin and on extracellular secreted proteins from both 
T. reesei strains. These results indicated the presence of trimming enzymes in T. 
reesei which were capable of the in vivo formation of acceptor-oligosaccharides 
for GlcNAc Tr I. 

In addition, the use of other proteins as acceptor substrates for GlcNAc 
Tr I was studied. The presence of a substrate for GlcNAc Tr I was determined 
using the GlcNAc Tr I assay described above in Example 5 by measuring the 
incorporation of radioactive GlcNAc into the glycoprotein by autoradiography of 
12.5% SDS-PAGE gels. Incorporation of radiolabeled GlcNAc was 
demonstrated for a-amylase from Aspergillus oryzae, obtained from Fluka, Buchs, 
Switzerland; a-amylase from Aspergillus niger, also obtained from Fluka; and for 
proteins in a commercial p-galactosidase preparation from Aspergillus oryzae, 
obtained from Sigma, St. Louis, MO, USA. See Figure 11. Incorporation of 
radiolabelled GlcNAc was also demonstrated for papain digested 
cellobiohydrolase I from Trichoderma reesei RUTC 30. Following papain 
digestion, cellobiohydrolase I no longer contains the C-terminal end of the enzyme 
which is heavily O-glycosylated. Van Tilbeurgh et al., FEBS Lett. 204:223-227 
(1986). 

Example 10 

Free Oligosaccharides as Acceptor Substrate for GlcNAc Tr I 

The oligosaccharides from Trichoderma extracellular glycoproteins were 
isolated to further evaluate and confirm that ManjGlcNAcj was the GlcNAc 
accepting N-glycan. Oligosaccharides were enzymatically removed with N- 
glycanase. This enzyme releases all types of N-linked sugar chains through 
cleavage of the grycosylamine linkage to asparagine residues (Tarentino et al, 
Biochemistry 24:4665-4671 (1985)). The oligosaccharides were then tested in a 
reaction mixture containing GlcNAc Tr I. A commercial mixture of 
oligosaccharides ranging from Man,GlcNAc, to Man, GlcNAq obtained from 
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ribonuclease B through hydrazinolysis (Oxford QycoSystems No. RP-2500) were 
used as a positive control for this experiment (Liang et al, J. Biochem. 88:5 1-58 
(1980)). Oligosaccharides that had been enzymaticaUy released from S. cerevisiae 
invertase were used as negative controls. GlcNAc Tr I transferred radioactive 
GlcNAc from UDP-( w C)GlcNAc to fungal ManjGlcNAc^ glycans. This was 
detected after separation of the manipulated oligosaccharides through thin layer 
chromatography (TLC). The position of the labelled fungal oligosaccharides, as 
revealed by autoradiography (Figure 6), coincided with that of the positive 
control. With the T. reesei mutant strain RUTC 30 an additional oligosaccharide 
structure that was larger than Mai^GlcNACj also acted as an acceptor for GlcNAc 
Tr I. This indicated the existence of differences in glycosylation patterns at the 
strain level. 

Example 11 

Galactosyltransf erase and Sialyltransj erase Assays 

Additional in vitro modifications of GIcNAc-bearing glycoproteins or 
oligosaccharides were carried out with the commercial (M,4 galactosyl transferase 
(Boehringer Mannheim) from human milk (Schanbacher et al, J. Biol Chem. 
245: 5057-5061 (1970)) and commercial a-2,6 sialyltransferase (Boehringer 
Mannheim) from rat liver (Weinstein et aL, J. Biol Chem. 257:13845-13853 
(1982)). Galactosylation was demonstrated through the incorporation of 
radioactive galactose on in vitro synthesized, non-labelled, GIcNAc-bearing 
glycosylstmctures. The completion of the synthetic pathway was demonstrated 
through the incorporation of radioactive sialic acid on "cold" galactose that was 
added in a p-1,4 linkage to GlcNAc. Glycoproteins labelled in this manner were 
separated through electrophoresis on a SDS-polyacrylamide gel. During 
electrophoresis, aalylated glycoproteins moved slower compared to non-sialylated 
proteins. This was observed as a shift of the sialylated proteins towards higher 
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molecular weights. To clearly demonstrate this effect, one Trichoderma protein, 
namely cellobiohydrolase I (CBH I), was examined in particular. 

After the QcNAc Tr I processing step, CBH I was purified from the COS 
supernatant-containing reaction mixture by ion-exchange chromatography on 
DEAE Sepharose (Shoemaker et al f Bio/Technology 7:687-690 (1983)). The 
purification step avoided any potential interference by COS supernatant proteins, 
which also can be an acceptor for sialyltransferase. 

An autoradiogram was obtained with CBH I-containing radioactive 
GlcNAc versus radioactive sialylated CBH I (Figure 7). A clean shift in the 
electrophoretic mobility of the sialylated CBH I as compared to the GlcN Ac- 
containing form was shown. This demonstrated that it was possible to modify the 
naturally occurring glycosylstructures on Trichoderma CBH I into hybrid 
structures with terminal sialic add residues. The two final modification steps were 
confirmed on the free oligosaccharides. The incorporation of galactose followed 
by the incorporation of sialic acid, always caused a shift in the molecular weight 
of the oligosaccharide molecules. This was observed as shifts in the mobility of 
the differently manipulated oligosaccharides in TLC. 

Example 12 

Improvement of Hybrid Glycoprotein Structure 
Through the Use of &l f 2 Mannosidase 

When using a fungus such as T reesei to produce heterologous, 
therapeutically valuable proteins, a substantial fraction of the high mannose 
oligosaccharides will not be an acceptor for GlcNAc Tr I. On the other hand, in 
mammalian cells, an a-1,2 mannosidase that is present in the cis-compartment of 
the Golgi apparatus trims most, if not all, high mannose structures to 
ManjGlcNAc2 and thus assures that carbohydrate chains are convertible to the 
complex type (Tulsiani et al, J. Biol Chem. 263:5408-5417 (1988)). 

A commercial a-1,2 mannosidase (Oxford Glycosystems, Oxford, UK) 
from A, saitoi (Yamashita et al % Biochem. Biophys. Res. Comm. 96:1335-1342 
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(1980)) was used to investigate to what extent glycosylation processing resembled 
that of mammalian cells. The effect of preincubation of fungal oligosaccharides 
or glycoproteins with a- 1,2 mannosidase on the in vitro incorporation of GlcNAc 
in a GlcNAc Tr I assay was determined. 

Oligosaccharides from bovine ribonuclease B, from T. reesei extracellular 
proteins were treated with A. saitoi a- 1,2 mannosidase. Treated and non-treated 
N-glycans were processed and visualized by fluorophore labelling of reducing 
saccharides (Figure 8). In this method, the reducing end of saccharides, liberated 
as described above, was reacted with 8-aminonaphthalene-l,3,6-trisulfonic acid 
(ANTS), as described by Jackson, P., "Methods in Enzymology. Guide to 
techniques in Glycobiology", Vol. 230, p. 250-265, Eds. Lennartz, W. & Hart, G. 
(1994)). Basically, 10 \i\ of 0. 1 5 M ANTS in acetic acid-water solution (3:17 v/v) 
and 10 fd of 1 M sodium cyanoborohydride, dissolved in DMSO were added to 
lyophilized oligosaccharides, enzymatically released from 400 jig glycoprotein or 
to 5 fig of bovine ribonuclease B oligosaccharides, commercially obtained from 
Oxford Glycosystems. No prior purification of oligosaccharides on a Biogel P2 
column was necessary. After overnight incubation at 37 °C, ANTS labelled 
oligosaccharides were precipitated through addition of 4 volumes of ice cold 
acetone (at minus 20°C). The oligosaccharides were pelleted through 
centrifugation (for 4 minutes at 10,000 x g). Unreacted ANTS in the acetone 
containing supernatant was removed. Oligosaccharides were redissolved in a 
suitable volume of glycerolrwater (1:4 v/v) and stored at minus 70°C. ANTS 
derivatized oligosaccharides were separated on a 30% acrylamide, 0.8% N,N'- 
methylene-bisacrylamide gel (on the SE 250 Mighty Small II electrophoretic 
apparatus, that is also used for separation of proteins through PAGE). In order 
to obtain "crystal clear" gels, the gel solution was poured between plastic sheets 
that were adhered to the mold plates with water. Stacking of the oligosaccharides 
was allowed at a current of 15 mA for 30 minutes, while separation happened at 
a current of 30 mA for 2.5 hours. As a reference standard, a glucose ladder was 
loaded, obtained after digestion of wheat starch (described by Jackson, P., 
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"Methods in Enzymology. Guide to techniques in Glycobiology", Vol. 230, p. 
250-265, Eds. Lennartz, W. & Hart, G. (1994)). The electrophoretic band 
patterns were viewed on a UV transilluminator and photographed through a red 
filter (Cokin A. 003; Cromofilter SA, Paris, France), using standard black-and- 
white film. A polaroid type 667 film with a speed of ISO3000 and an aperture of 
4.5 required an 8 sec exposure (Figure 8). 

The oligomannose-type N-glycans from ribonuclease B, ranging from 
Man 5 GlcNAc 2 to Ma^GlcNAq, were all converted to Map GlcNA? . This 
agreed with the feet that the biosynthesis of N-tinked sugar chains on ribonuclease 
B is known to be arrested at an intermediary stage of processing, as demonstrated 
by the fact that high mannose structures had not all been converted to 
Man 5 GlcNAc2 and had extra mannose residues only in the a-1,2 linkage. 
Oligosaccharides which had been isolated from a mixture of proteins secreted 
from T. reesei were partly converted to ManjGlcNAc* whereas a substantial part 
of the oligosaccharide pool was not converted. From the starting material, a 
fraction of the oligosaccharides seemed to be smaller than Mai^GlcNACj, while 
another fraction was larger than Mai^GlcNACj. In Fig. 8, only trimming of T 
reesei oligosaccharides is shown. 

The fungal oligosaccharides that had been treated with A. saitoi a-1,2 
mannosidase were then added to a reaction mixture for GlcNAc Tr I. The 
oligosaccharides were analyzed by thin layer chromatography, a-1,2 Mannosidase 
treatment in vitro favored the formation of acceptor substrate for GlcNAc Tr I 
(Figure 9) because much more radioactive GlcNAc was incorporated into the 
fungal oligosaccharides. A small fraction of the S. cerevisiae invertase 
oligosaccharides was also trimmed to the GlcNAc-accepting high mannose 
structure. Ma^GlcNACj glycans isolated from ovalbumin were the positive 
control for this experiment. On the autoradiogram, the position of the labelled 
fungal oligosaccharides coincided with that of the positive control. This 
confirmed the formation of GlcNAcMan 5 GlcNAc2 in the different reaction 
mixtures. 
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With the RUTC 30 Trichoderma strain, a-1,2 mannosidase treatment also 
gave rise to more of a second, larger GlcNAc-accepting oligosaccharide. When 
the mannosidase treatment was repeated on intact glycoproteins (Figure 10) the 
results were analogous with those obtained with free oligosaccharides. That is, 
more labelled GcNAc was transferred to ovalbumin and to 7! reesei glycoproteins 
when they were first pretreated with a-1,2 mannosidase. GlcNAc now was also 
transferred to S. cerevisiae invertase. 

In summary, the method of the invention allows the production of 
mammalian-like, hybrid oligosaccharides on glycoproteins secreted from the 
filamentous fungus T. reesei and its mutant strain Z reesei RUTC 30. The in 
vitro experiments used the following mammalian glycosyltransferases: human 
GlcNAc Tr I produced in COS cells; p-1,4 galactosyl transferase from human milk 
and a-2,6 sialyhransferase from rat liver. Preincubation with a- 1,2 mannosidase 
from A, saitoi substantially improved GlcNAc incorporation into the fungal 
proteins. This embodiment obtained a fungus that synthesized oligosaccharides 
that were convertible to complex mammalian-type structures in the simplest 
possible way. Since the first steps of the complex carbohydrate synthetic pathway 
were executed in vivo in a fungus such as Trichoderma, completion of the 
glycosylation pathway was possible through in vitro enzymatic oligosaccharide 
synthesis and modification. As a result of this invention, large scale stereo- 
controlled oligosaccharide synthesis will be possible in the future (Ichikawa et al 9 
Anal. Biochem 202:215-238 (1992)). The feet that a fungus can be manipulated 
to construct well-defined and uniform glycosyl structures on its secreted proteins 
offers possibilities to investigate the biological significance of oligosaccharide 
diversity and to perform carbohydrate engineering. For certain proteins, the 
production of a single glycoform may be produced to obtain a uniform therapeutic 
profile (Stanley, P., Molecular and Cellular Biology 9:377-383 (1989)). 
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Example 13 

Analysis oftheN-gfycan Profile of T. reesei Glycoproteins by 
Fluorophore Labelling 

A detailed image, both quantitatively and qualitatively, was obtained when 
oligosaccharides were derivatized with the fluorophore 8-aminonapthalene-l,3,6 
trisulphonate and subsequently electrophoreticalh/ separated on a poh/acrylamide 
gel. In Figure 4, we demonstrate the oligosaccharide pattern from T. reesei 
RUTC 30 cellobiohydrolase I, purified after fermentation in minimal medium 
(Nyyssonen et aL, Bio/Technology 77:591-595 (1993)). The pattern obtained 
with a mixture of proteins secreted by the same Trichoderma strain, but in a 
different growth medium (according to Uusitalo et a/., 1991) is also shown. Only 
a minor fraction of oligosaccharides from the starting material seems to be 
ManjGlcNACj. More abundant are Mai^GlcNAc,, Man 7 GlcNAc* Ma^GlcNACj 
and Man,GlcNAc2 oligosaccharides. N-glycans larger than ManyGlcNA^ were 
also found. 

With pure CBH I, oligosaccharides even smaller than ManjGlcNA^ were 
isolated. In general, the Trichoderma sugar chains are clearly much smaller than 
those isolated from Saccharomyces cerevisiae glycoproteins. Since 0-1,2 
mannosidase trimming is incomplete (Figure 8), it is very likely that mannose 
residues are present in linkages different from the a-1,2 linkage due to transfer to 
trimming products by one or more mannosyl transferases. 

Example 14 

Determination ofT. reesei Mannosidase and Mannosyltransf erase 
Activities and Conclusions 

In order to investigate further in detail the resemblance between T. reesei 
glycosyi synthesis and the mammalian system, we tried to localize the mannosidase 
activity responsible for the oligosaccharide panel on the secreted fungal proteins. 
Fust, we checked whether mannosidases were present in the extracellular medium 
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of the fungus, which could eventually be responsible for "postsecretional" 
trimming of the oligosaccharides on other secreted proteins. We developed an 
assay to determine this activity as follows: oligosaccharides from bovine 
ribonuclease B (commercially available as a reference panel from Oxford 
dycosystems) were used as substrate for mannosidases. A complete conversion 
to Man 5 GlcNAc2 is obtained if enough a- 1,2 mannosidase is present. 
MaiisGlcNAcj is farther trimmed if cc-1,3 and/or a- 1,6 mannosidases are present. 
After labelling with ANTS, the ribonuclease B reference oligosaccharides were 
added to 70-fold concentrated extracellular medium from a four day old T. reesei 
culture (with fungal cells removed). A four day old, concentrated culture 
supernatant from Aspergillus saitoi was used as a positive control for this assay: 
it is known that the latter organism secretes different mannosidases (an a- 1,2 and 
ct-l,3(6) mannosidase) when grown on minimal medium containing mannan as the 
carbon source (Keskar, S.S. et al, Biotechnology Letters 75:695-690 (1993)). 
In Figure 7, we demonstrate the result of an overnight incubation of the reaction 
mixtures: the a-1,2 manfiosidase activity from A. saitoi is clearly detectable. 
Most ribonuclease B oligosaccharides are trimmed to ManjGlcNAc^. No or only 
marginal quantities of mannosidase are present in the medium of T. reesei, since 
the ribonuclease reference panel remained untouched. We mention here that 71 
reesei secreted many more protons in its minimal medium (about 300 jig/ml) than 
did A. saitoi (barely detectable, using the Bradford assay). 

Using the same ANTS labelled oligosaccharide panel from ribonuclease B, 
we found intracellular mannosidase activity in crude extracts from 71 reesei, A. 
saitoi, L929 mouse cells and from 5. cerevisiae. We further tried to discriminate 
mainly between mannosidases, active in glycosylation processing in the 
endoplasmic reticulum, the Golgi apparatus and those active in vacuoles. Using 
differential centrifugation or sucrose gradients to separate organelles, prepared 
from T. reesei cells, we did not obtain sufficient resolution of different 
mannosidase activities in different organelles. However, a-1,2 mannosidase 
activity was abundantly present in fractions that were enriched for 
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mannosyltransferase activity. Determination of the mannosyltransferase activity 
was done after adaptation of the assay, described by Nakajima and Ballou (Proc. 
Natl Acad Set USA 72:3912-3916 (1975)). With reaction mixtures containing 
UDP-mannose as sugar-donor, ANTS labelled Ma^GlcNAc as a substrate and 
glycosylation enzymes solubilized from the membrane preparations by Triton X 
100, mannosyltransferase activity was demonstrated by retardation in the 
electrophoretic mobility of the substrate oligosaccharides (results not shown). 

The examples above demonstrate that it is possible to construct 
mammalian-like, hybrid oligosaccharides on glycoproteins secreted from the 
filamentous fungus Trichodermareesei. Mammalian glycosyltransferases, namely 
human N-acetylglucosaminyl transferase I produced in COS cells, P- 1,4 
galactosyl transferase from human milk, and a-2,6 sialyltransferase from rat liver, 
were used to mimic in vitro the glycosylation synthesis pathway as it occurs in 
mammalian cells. Preincubation with a-1,2 mannosidase from Aspergillus saitoi 
substantially improved GlcNAc incorporation on the fungal proteins. With free 
N-glycans isolated from Trichoderma reesei cellulases, it was demonstrated that 
not all sugar chains are convertible to the ideal acceptor substrate for GlucNAc 
Tr I: Maii^cNAc* The majority of the N-linked oligosaccharides resemble the 
high mannose structures present on certain proteins from mammalian cells, ranging 
predominantly from Man^GicNAcj to Mai^GlcNAcj. With pure CBH I, obtained 
after fermentation in Trichoderma minimal medium (NyyssSnen et al. y 1993) a 
substantial fraction of the oligosaccharides seems to be even smaller than 
ManjGlcNAcj. Using another fermentation medium, we now also obtained 
oligosaccharides larger than Mar^GlcNAc,. The electrophoretic mobility of CBH 
I produced in the different media also changed notably. After deglycosylation, 
CBH I produced as described by Nyyssdnen et al y (1993) shifted only slightly 
compared to the original form after electrophoretic separation on a polyacrylamide 
gel. Exhaustive digestion (long incubation time and many units N-glycanase) was 
needed to liberate oligosaccharides. With the growth conditions we used, CBH 
I was easily deglycosylated, giving rise to a pronounced electrophoretic shift 



WO 96/21038 



PCT/FI96/00002 



55 

compared to the original protein (results not shown). The effects, environmental 
factors can have on protein glycosylation are profoundly described by Goochee 
and Monica (Bio/Technology 5:421-427 (1990), Bio/Technology 9:1347-1355 
(1991)). Their reviews strengthen our belief that the growth medium can be the 
cause of the differences in glycosyl patterns found on Trichoderma proteins. 

From the experiments we conducted to elucidate the localization of the 
mannosidase(s) responsible for the small N-glycans on Trichoderma secreted 
proteins. The possibility of secretion of mannosidases that could act on the other 
secreted fungal protons seems very unlikely. Even with very concentrated growth 
medium and after a long incubation time, no trimming at all of substrate 
oligosaccharides occurred. 

We presume that processing in the Trichoderma Golgi apparatus 
resembles partly that of mammalian cells and partly that of yeast: Mannosidases 
capable of trimming back the Trichoderma N-glycans to ManjGlcNAcj seem to 
be present. On the other hand, mannosyltransferases capable of transferring 
mannose residues in linkages different from the a- 1,2 linkage, are also present. 
The latter conclusion was confirmed after carrying out mannosyltransferase 
assays, analogous to the one described by Nakajima and Ballou (1973): using 
ANTS labelled Man,GlcNAc as a substrate, we managed to demonstrate 
incorporation of up to two mannose residues. On the other hand, Man 5 GlcNAc 
was no substrate for mannosyl transferases. Hence, we wonder if Mar^GlcNAc^ 
the crucial structure for synthesis of complex sugar structures, will not be a 
substrate for mannosyl transferase^) *in vivo" in T. reesei. 

It is not yet known how similar glycosylation synthesis in Trichoderma 
reesei proceeds to that in mammalian cells. Therefore it is difficult to evaluate 
whether this organism can be easily manipulated to have more, if not all, of the 
glycosyl groups converted to mammalian-like structures. The in vitro experiments 
are an introductory, first step to manipulation of this fungus, so that it synthesizes 
oligosaccharides that are convertible to complex structures in the simplest possible 
way. If the first steps of the complex carbohydrate synthesis pathway would 
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occur in vivo in a fungus, such as Trichoderma, completion of the glycosylation 
pathway could be carried out by in vitro enzymatic oligosaccharide modification 
on proteins. We believe that large scale, stereo-controlled oligosaccharide 
synthesis will be possible in the future: much research is in progress to reduce the 
high expenses associated with these in vitro processing steps (Ichikawa et aL, 
1992). Finally, another important aspect concerning this research can be 
mentioned: the fact that a fungus can be manipulated to synthesize well-defined 
and uniform glycosyl structures on its secreted proteins. For certain proteins, the 
production of single glycoform may be desired to obtain an uniform therapeutic 
profile. 

Example IS 

Transformation ofT. reesei with GlcNAc Tr I coding sequences 
and Expression of Modified Glycoprotein from the Trichoderma Host 

A DN A vector is constructed that contains the coding sequence of human 
GlcNAc Tr I (Genbank Accession No. M55621; EMBL ID No. HSGLCNAC) 
as described above, and such that the DNA providing coding sequence for the 
GlcNAc Tr I enzyme is operably linked to the CBHI promoter and terminator. 
Such vector is transformed into T. reesei, using the technique described in EP 
244,234, and positive transformants selected. 

One of the positive transformants is further transformed with a second 
vector, encoding ovalbumin in a form that was secreted from the Trichoderma 
host, and transformants were selected that expressed both GlcNAc Tr I and 
ovalbumin. 

Ovalbumin that is secreted into the Trichoderma medium is already 
processed to a first hybrid form by the intracellular action of GlcNAc Tr L This 
ovalbumin is further processed using p-l,4-galactosyltransferase and a-2,6- 
sialyltransferase as described in Example 1 1 . 
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Example 16 

Transformation ofT. reesei with GlcNAc Tr I coding sequences 
and Expression of Modified Glycoprotein from the Trichoderma Host 



As in Example 15, except that the ovalbumin is processed with non- 
specific cc-mannosidase after treatment with p-l,4-galactosyI transferase, or after 
treatment with a-2,6-sialyltransferase. 



Example 17 

Transformation ofT. reesei with GlcNAc Tr I coding sequences 
and Expression of Modified Glycoprotein from the Trichoderma Host 



As in Example 15 or Example 16, except that the gene of interest encodes 
erythropoietin (EPO), human choriogonadotropin (HCG), follitropin (FSH), 
thyrotropin (TSH), lutropin (LH) NCAM, tenascin, thrombospondin, fibronectin, 
hormones, cytokines, interleukin-4, interferon gamma, growth factors, plasma 
proteins, coagulation factors, soluble receptors, an antibody, granulocyte- 
macrophage colony stimulating factor (GM-CSF), vascular endothelial growth 
factor, HIV virus glycoprotein gpl20, viral coat proteins, immunoglobulin D, 
anthhrombin IDp, plasminogen, von Willebrand factor, fibrinogen, corticosteroid- 
binding globulin, thyroxine-binding globulin, folate-binding protein, fibrinectin, 
bone and platelet osteonectin, EGF receptor, insulin and insulin-like growth 
fector-I receptors, basic fibroblast growth factor receptor, lymphocyte CD2, MHC 
Class II molecules, glucose transporters, erythrocyte band 3 protein, P-2 
adrenergic receptors, transferrin receptor, VIP receptor, membrane class I MHC 
protein, vasopressin receptor of LLC-PK1 cells, low density lipoprotein receptor, 
asialoglycoprotein receptor, CD4 protein, thyrotropin receptor, PDGF receptor, 
jack bean concanavalin A, cobra venom factor, lactotransferrin, spinach 
chloroplast coupling factor, submaxillary gland mucin, intestinal brush border 
lactase-phlorizin hydrolase, arctic fish antifreeze glycoprotein, LDL receptor, 
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glucophorin A or p-HCG, a peptide subunit of any of the above, or a functional 
fragment of any of the above. 

Example 18 
Modification of Proteins Produced in Yeast 

A protein of interest, especially one described in Examples 15 and 17 is 
produced in a yeast selected from Pichia spp (especially Pichia pastoris), 
Hansenula spp (especially Hansermla polymorpha), Khyveromyces lactis, 
Yarrowia lipolytics, or £ cerevisiae and treated as described in Example 12 with 
an cc-1,2 mannosidase prior to sequential treatment with GlcNAc Tr I, 0-1,4 
galactosyl transferase and a2,6-siaryl transferase. 
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Table 1 

Datasheet: pCAMFhGNTIfl 
Host/Plasmid Information 

Storage No: 3124 
Host Bacterium type: MC1061 
Helper Plasmid: 
Host/Plasmid Resistance: amp 
Temperature: 37 °C 
Additional Info: 



Plasmid Description 

CG: preMFcr1/hGlcNAc-Tl (fusion of the prepro secretion signal sequence of the S.cerevisiae cr-mating 
factor 1 gene to the human cDNA fragment encoding mature N-acetylglucosaminyitransferase I) 

SN: 3124 

RU: Restricted Use 

PG: EU-VARIA 
SF: p3124.SEQ 



Code 


Start 


Stop 


Description 


Comments 


LEN 


6230U 






LENGTH PLASMID 


CLO 


1D 


170D 


rGlBf 


PART EXON 3 ♦ 3UTR ♦ polyR Or RABBIT P-GLOBIN 


CLO 


171 D 


529D 


3FR 


3* FLANKING REGION OF RABBIT 3-GlOBIN GENE 


PRO 


689D 


605O 


lac 


PROMOTER 


ORI 


747D 


948D 


SVORI 


SV40 FRAGMENT, CONTAINING BIDIRECTIONAL ORI 


SIG 


S51D 


10250 


polyA 


polyA SIGNAL OF THE SV40 EARLY REGION 


SIG 


1C25D 


951D 


poryA 


polyA SIGNAL OF THE SV40 LATE REGION 


ORI 


1540U 


1140U 


ORI 


ORIGIN PLASMID 


SEL 


2943D 


2083D 


AMP 


ANTIBIOTIC RESISTANCE GENE 


CLO 


30H3D 


3460D 


hCMV 


hCMV-IE ENHANCER 


PRO 


3472U 


4705D 


Ac 


CHICKEN p-ACTIN PROMOTER 


SIG 


4721 D 


4975D 


preMF 


S.cerevisiae a-MATING FACTOR t PREPRO SEQ. 
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GEN 4576D 61 90D GNTlf hGlcNAc-TI FRAGM. CONTAINING MATURE SEQ. 

HC: E.coli 

Mammalian cells (COS cells for transient expression; others for permanent transformation) 

155: 

OR: Eco-pMB1 
SV40 

PC: p CAGGS;pSCGAL1MF3;pUChGNTI1 
PR: hCMV-IE (promoter enhancer) 

CHI-Ac 

Eco-lac 
RB : Eco-lacZ 

RF: Kumar et al., Proc. Natl. Acad. Sci. USA 87 (1 990), 9948-9952 
Marleen Maras and Roland Contreras 

RK: Start of the nucleotide sequence in the middle of the EcoRI site. 

This plasmid was made as follows: 1) The Xbal-Stul fragment of pSCGAL1MF3, containing the 
prepro secretion signal sequence of the S.cerevisiae a-mating factor 1 gene, was ligated to the 
Xbal-Eco47III vector fragment of pUChGNTH ; 2) This intermeidate construction was then digested 
with Hindlll and EcoRI, the sticky ends were filled in with Klenow DNA polymerase and EcoR! 
linkers (5-CGGAATTCCG-3') were added; 3) Finally, this EcoRl-EcoRl fragment was digested with 
Xbal and EcoRI and the resulting Xbat-EcoRI fragment was inserted into the Xbal-EcoRI digested 
pCAGGS vector. 

This expression plasmid carries a fusion of the prepro secretion signal sequence of the 
S.cerevisiae a-mating factor 1 gene to the mature human gene encoding N- 
acetyiglucosaminyttransferase I, cloned into the sense orientation relative to the chicken p-actin 
promoter which is preceded by the CMV-IE enhancer sequence. 

The nucleotide sequence of the human GlcNAc-TI gene was obtained from the EMBL Nucleotide 

Sequence Database (Accession number M55621). 
RS: IMJJII, U-AlwNI, U-Apal, LMvril, U-Bbsl, U-Bcgl, U-BsaAl, U-Bsml, U-Bsp1 201 , U-Drdl, U-EcoRi, 

U-Fspl, U-Hindlll, U-Maml, U-Mstll, U-Nsil, U-Pf)MI, U-Pvui, U-Sfit, U-Smai, U-SnaBI, U-Snal, U- 

Spel, U-Sphl, U-Sse8387l, U-Sspl, U-Xbal, U-Xcml, U-Xmal 
SE: amp 

SQ: /pCAGGS/1 3/4722/p21 54/724/460/hGNTl/285/1 505/TCGACTGG AV 
/pUC1 8/41 9/400/CGGAA;// 

IEI 

VS: SHUTTLE 



SD: 
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Table 2 

Datasheet: pSCGALlMF3 

Host/Plasmid Information 

Storage No: 3123 

Host Bacterium type: MC1061 

Helper Plasmid: 

Host/Plasmid Resistance: amp 

Temperature: 37 'C 

Additional Info: 



Plasmid Description 

CG: preMF c1 (prepro secretion signal of the S.cerevisiae a-msting factor 1 gene) 

SN: 3123 

RU: - 

PG; YEAST 

SF: p3123.SEQ 

EI: yji 



Code 


Start 


Stop 


Description 


Comments 


LEN 


3477U 






LENGTH PLASMID 


GEN 


451D 


2380 


lacZ 


tacZor DMA 


SIG 


71 5D 


452D 


preMF 


S.cerevisiae a-MATING FACTOR 1 PREPRO SEQ. 


FRO 


1211U 


8280 


GAL1 


PROMOTER 


FRO 


1352U 


1299D 


lac 


PROMOTER 


GRI 


18S0U 


1470U 


ORl 


ORIGIN PLASMID 


SEL 


3277D 


2417D 


AMP 


ANTIBIOTIC RESISTANCE GENE 



HC: E.coii 

OR: Eco-pM31 
PC: pSCGAL1MF2 
PR: Sce-GAL1 

Ecc-lac 
BSl Ecc-lacZ 
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PF: Peter impe and Roland Contreras 

RK ; start of the nucleotide sequence between the ampicillin resistance gene and the lacZ DNA. 

This plasmid was made as follows: pSCGAL1MF2 was Accl opened, filled in with Klenow DNA 
polymerase and ligated. In this way the Hindi and Sail sites of the polylinker are lost, an Nrul site 
in the polylinker is created, and the Hindil site in the prepro secretion signal of the S.cerevisiae 
a-mating factor gene 1 becomes unique. 
U-Maml cust only in dam-negative strains. 

RS: E-Asp718, E-BspMI, E-Hindlll, E-Kpnl, E-Nrul. E-Sphl, E-Sse8387l, E-Stul 

U-Aatll, U-Aflill, U-Agel, U-AlwNI, U-Asp700, U-Bbel, U-Bcgl, U-Bsal, U-BspMIl, 
U-Eam1105l, U-EcoO109, U-Ehel, U-Hindll, U-Kasl, U-Maml, U-Narl, U-Ndel, 
U-Pssl, U-Scal, U-Sspl 

SE: amp 

SQ: /p2154/1/432/p2154/431/3475// 
TE: 

VS: PLASMID 
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Table 3 

Datasheet: pCAGGS 

Host/Plasmid Information 

Storage No: 2453 
Host Bacterium type: MC1061 
Helper Plasmid: 
Host/Plasmid Resistance: amp 
Temperature: 37 *C 
Additional Info: 

Plasmid Description 

SN: 2453 

RU: Restricted Use 

PG: EU-VARIA 
SF: pCAGGS.SEQ 

DT: LU; 



EL: 



Code 


Start 


Stop 


Description 


Comments 


LEN 


481 eu 






LENGTH PLASMID 


CLO 


10D 


1B2D 


rGISf 


PART EXON 3 ♦ 3UTR ♦ pofyR Or RABBIT [J-GLOBIN 


CLO 


1B3D 


541D 


3FR 


3' FLANKING REGION OF RABBIT fJ-GLOBIN GENE 


PRO 


701 U 


618D 


lac 


PROMOTER 


ORI 


859D 


961D 


SVORI 


SV40 FRAGMENT, CONTAINING BIDIRECTIONAL ORI 


SIG 


953D 


1C37D 


polyA 


polyA SIGNAL OF THE SV40 EARLY REGION 


SIG 


1037D 


952D 


polyA 


potyA SIGNAL OF THE SV40 LATE REGION 


OR! 


1555U 


1145U 


ORI 


ORIGIN PLASMID 


S=L 


2955D 


2095D 


AMP 


ANTI3IOTIC RESISTANCE GENE 


CLO 


3055O 


34720 


CK'V 


CMV-IE ENHANCER 


PRO 


346iU 


481SD 


AG 


0-ACTIN/O-GLOBIN HYBRID PROMOTER: SEE RK 



HC: E.COLI 

Mammalian cells (COS-cells for transient expression; others for permanent transformation) 
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OR: Eco-pMB1 
SV40 

PC: p uci 3; pAGS-3 (neither of them is present in the physical stock) 
PR: CHI/RAB-AG 

hCMV-IE (promoter enhancer) 

Eco-Iac 
RB: Eco-lacZ 

RF: This plasmid was a gift from Prof. J. Miyazaki (University of Tokyo, Japan). 
Niwa et al., Gene 108 (1991). 193-200 
Peter Vanhoenacker 

RK: Start of the nucleotide sequence in the middle of the non-unique EcoRI site. 

The nucleotide sequence of the plasmid was reconstructed according to the information in the 
publication of Niwa et al.. 1991 . The following restriction sites were experimentally verified (Marc 
Van de Craen and Vera Goossens): Bglll. BsaAI. Hindll. Mstll, Ncol. Sspl. Pvull. Xbal and Xhol. 
This plasmid is useful for highly efficient expression of genes under the control of the chicken 0- 
actin/rabbit 0-globin hybrid promoter + CMV-IE enhancer in various mammalian cells. 
pCAGGS carries a modified chicken P-actin promoter: the splice accepter sequence of the P-actin 
promoter region is replaced by a rabit p-globin fragment containing a 3' part of the second intron 
and a 5' part f the third exon. The resulting 0-actin/(3-globin hybrid promoter is designated the AG 
promoter. 

U-Bcll cuts only in dam-negative strains. 

RS: U-Accl. U-Apal, U-Asp700. U-Ball. U-Bcgl, U-Bglll. U-BsaAI. 

U-Bsal. U-Bsml. U-Bsp120l. U-BstXI. U-Drdl, U-Eagl. U-Eam1105l. 
U-Espl. U-Fspl, U-Hindll. U-Hindlll. U-Mstll. U-Nsil, U-PflMI. 
U-Pstl, U-Pvul. U-Sacll. U-Sall, U-Scal, U-SfaNl. U-Sfil, 
U-SgrAI. U-SnaBI. U-Spel, U-Sspl. U-Stul, U-Xbal. U-Xhol 



SQ: 



SE: amp 

mCCTCGAGGAA-/RGIobinB/1545/2069rNNNNC7 P UC13/427/624rNNNNNNNNNNN7 
/ P AGS-3/350/4fN7pUC1 3/625/261 1/-NNNNNGTC7pCDM8/281 5/2435/-NNNNNGTC7 
/pAGS-3/2998/1664// 

IE: 

VS: SHUTTLE 



SP: 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT! 

(ii) TITLE OP INVENTION: Methods of Modifying Carbohydrate 
Moieties 

(iii) NUMBER OF SEQUENCES: 5 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 

(C) CITY: 

(D) STATS: 

(E) COUNTRY: 

(F) ZIP: 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 

(B) TELEFAX: 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
CCAGCATGCT GAACAACCAC TCTGCA 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS? 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 
(0) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE; DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCAGTCGACA CCTGCTAATT CCAGCTAG 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Leu Lye Lye Gin Ser Ala Gly Leu Val Leu Trp Gly Ala He Leu 
1 5 10 15 

Phe Val Ala Trp Asn Ala Leu Leu Leu Leu Phe Phe Trp Thr Arg Pro 
20 25 30 

Ala Pro Gly Arg Pro Pro Ser Val Ser 
35 40 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..255 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 

ATG AGA TTT CCT TCA ATT TTT ACT CCA CTT TTA TTC GCA GCA TCC TCC 48 
Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA GAA GAT GAA ACG GCA CAA 96 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

ATT CCG GCT GAA GCT GTC ATC GGT TAC TCA CAT TTA GAA GGC GAT TTC 144 
lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Cly Asp Phe 
35 40 45 

GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG 192 
Asp Val Ala val Leu Pro Phe Ser Asn Ser Thr Asn Acn Gly Leu Leu 
50 55 60 

TTT ATA AAT ACT ACT ATT GCC AGC ATT CCT GCT AAA GAA GAA GGG GTA 240 
Phe lie Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

TCT TTG GAT AAA AGG 255 
Ser Leu Asp Lys Arg 
85 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY £ linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
X 5 10 15 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

He Pro Ala Glu Ala Val He Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe He Asn Thr Thr He Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

Ser Leu Asp Lys Arg 
85 
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1 . A process for producing a hybrid glycoprotein, said method comprising: 

(a) culturing filamentous fungi so as to produce a high mannose type glycoprotein 
that is an acceptor substrate for GlcNAc Tr I; 

(b) reacting said glycoprotein with GlcNAc Tr I; 

(c) reacting the glycoprotein product of step (b) with P-1,4 galactosyl transferase; 

(d) reacting the glycoprotein product of step (c) with a-2,6-sialykransferase. 

2. The process of claim 1, wherein said glycoprotein is a protein that is homologous to said 
fungi. 

3. The process of claim 1, wherein said glycoprotein is a protein that is heterologous to said 
fungi. 

4. The process of claim 1, wherein said fungi is a Trichoderma. 

5. The process of claim 4, wherein said Trichoderma is Trichoderma reesei. 

6. The process of any one of claims 1-5, wherein step (b) occurs intracellular^ in said host, 
and steps (c) and (d) are performed in vitro. 

7. The process of any one of claims 1-5, wherein steps (b), (c) and (d) are performed in vitro. 

8. The process of claim 7, wherein said glycoprotein is reacted with ct-1,2 mannosidase 
between steps (a) and (b). 

9. A process for producing a hybrid glycoprotein, said method comprising: 

(a) reacting a high mannose type glycoprotein with an o-l,2-mannosidase; 

(b) reacting the glycoprotein product of step (a) with GlcNAc Tr I; 

(c) reacting the glycoprotein product of step (b) with P-1,4 galactosyl transferase; 

(d) reacting the glycoprotein product of step (c) with o-2,6-sialyltransferase. 
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10. The process of claim 9, wherein step (a) occurs intracellular^ in a fungus expressing said 
high mannose type glycoprotein and expressing said a-l,2-mannosidase; and further wherein steps 
(b), (c), and (d) are performed in vitro. 

11. The process of claim 9, wherein steps (a) and (b) occur intracellularly in a fungus 
expressing said high mannose type glycoprotein, said a-l,2-mannosidase, and said GlcNAc Tr I; 
and further wherein steps (c) and (d) are performed in vitro. 

12. The process of claims 9-1 1, wherein said high mannose glycoprotein was produced in a 
yeast. 

13. The process of claim 12, wherein said yeast is selected from the group consisting of a 
member of the Pichia spp., Hansenula spp. f Kluyveromyces lactis, £ cerevisiae, and Yarrowia 
lipolytica. 

14. The process of claim 13, wherein said Pichia spp. is Pichia pastoris. 

15. The process of claim 13, wherein Hansenula spp. is Hansenula polymorpha. 

16. The process of claim 13, wherein said yeast is S. cerevisiae. 

17. The process of claims 9-1 1, wherein said high mannose glycoprotein was produced in 
filamentous fungi. 

18. The process of claim 17, wherein said fungi is a Trichoderma. 

1 9. The process of claim 18, wherein said Trichoderma is Trichoderma reesei. 
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